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OM protein - protein search, using sw model 

Run on: December 12, 2003, 18:06:22 ; Search time 21 Seconds 

(without alignments) 
638.692 Million cell updates/sec 



Title : US-09-8 91-13 8A-2 

Perfect score: 1650 

Sequence: 1 MAQNL S CENWLATE AI LNKY REML1SKFRQYFKSLTSFRT 317 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing : Minimum Match 0% 

Maximum Match 100% . 
Listing first 4.5 summaries 



328717 



Database 



Issued_Patents__AA: * 
1 : /cgn2_6/ptodata/ l/iaa/5A__C0MB .pep: * 
2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 
3 : /cgn2_6/ptodata/l/iaaA6A_COMB.pep: * 
4 : /cgn2__6/ptodata/l/iaa/6B__COMB.pep: * 
/ cgn2_6 /p t odat a/l/iaa/ PCTUS_COMB . pep : 
/cgn2_6/ptodata/l/ iaa/backf ilesl . pep : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

US-08-559-524A-2 

; Sequence 2, Application US/08559524A 

; Patent No. 5871963 

; GENERAL INFORMATION: 

APPLICANT : Conley, Pamela B. 

APPLICANT: Jantzen, Hans-Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 
; . NUMBER OF SEQUENCES : 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 

STREET: 1800 M Street, N.W. 

CITY: Washington 

STATE: D.C. 

COUNTRY : USA 
; ZIP: 20036-5869 

CCMPCJTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 



COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/559 , 524A 

FILING DATE: 15-NOV-1995 

CLASSIFICATION: 43 5 
ATTORNEY/ AGENT INFORMATION: 

NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 

REFERENCE/DOCKET NUMBER: 0444 81 - 5 0 10 - 0 0 -US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-467-7000 

TELEFAX: 202-467-7176 
; INFORMATION FOR SEQ ID NO : 2 : 
SEQUENCE CHARACTERISTICS : 

LENGTH: 334 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-559-524A-2 

Query Match 74.3%; Score 1226.5; DB 2; Length 334; 

Best Local Similarity 71.5%; Pred. No. 4e-93; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I I i I IMM II Ilh hMI Ihlhl =1 lllllhll II 

Db 5 MAWNATCKNWLAAE AALE KY YL S I F YG I E F WGVLGNT I WYG Y I FS LKNWNS SN I YL FN 64 

Qv 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

: I hi I I Mill hi hill I MM II Mill III Mill MMMIhl Ilh: 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSII.FLTFISIDRYLII 124^ 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

,111:1 IIMIIIIIIIMM llllhlhl II I : h I hlllhl M 

Db 12 5 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

IMhIIIIIMIIII IMIMIh ^\\\--\--'-\ lllllhll IhMIIMhl 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

llllhllhllllll III III III : I I I I I I I llhlllhllhh 
Db 24 5 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 3 04 

Qv 300 ML I S KFRQY FKSLTS F 315 

lh = = I Illlill 

Db 30 5 MLMNQLRHNFKSLTSF 32 0 



RESULT 2 
US-08-749-^07-2 

; Sequence 2, Application US/08749707 

; Patent No. 6063582 

; GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 



APPLICANT: Jantzen, Hans -Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 

NUMBER OF SEQUENCES : 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN , LEWIS & BOCKIUS LLP 

STREET: 18 00 M Street., N.W. 

CITY: Washington 

STATE : D . C . 

COUNTRY : USA 

ZIP: 20036-5859 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/749 , 707 

FILING DATE: 15 -NOV- 1996 

CLASSIFICATION: 536 
ATTORNEY/ AGENT INFORMATION: 

NAME: Adler, Reid G. 

REGISTRATION NUMBER: 30,988 

REFERENCE /DOCKET NUMBER: 044481-5010 -01-US 
TELECOMMUNICATION INFORMATION : 
TELEPHONE: 202-467-7000 
TELEFAX: 202-467-7176 
; INFORMATION FOR SEQ ID NO : 2: 

SEQUENCE CHARACTERISTICS : 
; LENGTH: 3 34 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
T JS-03 -749-707-2 

Query Match 74.3%; Score 1226.5; DB 3; Length 334; 

Best Local Similarity 71.5%; Pred. No. 4e-93; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 1; 
Qy 1 MAQNLSCENWLATEAI LNKYYLS AFYAI EF I FGLLGNVT WFGYLFCMKNWNSSNVYLFN 60 



Db 



5 




Qy 



LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0' 



Db 



6 5 




Qy 



121 



KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0. 



Db 



125 




Qy 



18.1 



LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 




Qy 



241 



FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 



Db 



245 




Qy 300 ML I S KFRQYF KSLTS F 315 

I— I IMIIII 

Db 3 05 MLMNQLRHNF KS LTS F 320 



RESULT 3 

US-08-559-524A-4 

; Sequence 4, Application US/08559524A 

; Patent Mo. 5871963 

; GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
; APPLICANT: Jantzen, Hans -Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 
NUMBER OF SEQUENCES : 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 
STREET: 1800 M Street, N.W. 
; CITY: Washington 

STATE : D . C . 
COUNTRY : USA 
ZIP : 20036-5869 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/559 , 524A 
FILING DATE: 15 -NOV- 1995 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Adler, Reid G. 
REGISTRATION NUMBER: 30,988 

REFERENCE /DOCKET NUMBER: 044481-5010 -00 -US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-467-7000 
TELEFAX: 2 02-467-7176 
INFORMATION FOR SEQ ID NO : 4: 
: SEQUENCE CHARACTERISTICS : 

; LENGTH: 3 73 amino acids 

TYPE: amino acid 
STPANDEDNESS : 
TOPOLOGY : 1 inear 
MOLECULE TYPE: protein 
US-08-559-524A-4 

Query Match 2 8.7'%; Score 4 74; DB 2; Length 3 73; 

Best Local Similarity 36.6%; Pred. No. 1.9e-31; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6 

Qy 2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

I I I M = I I I I I I : : : : ! | | | : : | | : | | | = : : | | : : | | | | | | 
Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 AND KG - T YGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLMKYPFREHFLQRKEFAI L I 13 8 

I =11 = I I = = I 1 I I I I I I ! I I I II = i I : 11=1= I 



Db 



112 FNKTDW I FGDAMCKLQRF I FHVNL YGS ILFLTC I SAHRYSG WYPLKS LGRLKKKNAVYI 171 



Qy 139 3LAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

h II =| : : hi : = = I = III Mhl h I hi 

Db 172 SVLVWLIVWGISPILFYSGTGIRKMKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

I h I • =111 || = = = =|=: = hhh = = hi 

Db 232 LILGCYGLIVRALIYKDLDMS PLRRKSIYLVI IVLTVFAVSYIPFHVMKTMNLRARL 288 

Qy 257 D - S WPQGCT - QKAI KS I YTLTRPLAFLNS AINP I FYFLMGDHYREML 301 

I hi : : | :|| || |h :hl III II h I 

Db 289 D FQTPEMCAFNDRVYAT YQVTRGLAS LNS C VD P I L YFLAGDTFRRRL 3 35 



RESULT 4 
US-08-749-707-4 

Sequence 4, Application US/08749707 
Patent No. 6063582 
GENERAL INFORMATION: 

APPLICANT: Conley, Pamela B. 
APPLICANT: Jantzen, Hans -Michael 

TITLE OF INVENTION: NOVEL PURINERGIC RECEPTOR 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 
STREET: 1800 M Street N . W . 
CITY : Washington 
STATE: D.C. 
COUNTRY : USA 
ZIP : 20036-5869 
COMPUTER READABLE FORM : 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER : US/ 0 8 / 74 9 , 70 7 
FILING DATE: 15- NOV - 1 9 9 6 
CLASSIFICATION: 53 6 
ATTORNEY/ AGENT INFORMATION: 
NAME: Adler, Reid G. 
REGISTRATION NUMBER: 30,988 

REFERENCE/DOCKET NUMBER: 044481-5010 -01-US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE : 2 02-4 67-7000 
TELEFAX: 2 02-4 67-7176 
INFORMATION FOR SEQ ID NO : 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 373 amino acids 
TYPE: amino acid 
STRANDEDNESS : 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-749-707-4 

Query Match 28.7%; Score 474; DB 3; Length 373; 



Best Local Similarity 36.6%; Pred. No. 1.9e-31; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6 



Qy 2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

III I I = Ml III == = = | II h :||:|||:::|| == ||| || | 

Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I :|| =1 h: I Ml II I IN II II = II = Ih M I 

Db 112 FNKTDWI FGDAMCKLQRF I FHVTSTLYGS ILFLTCI SAHRYSGVVYPLKSLGRLKKKNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLS 197 

h || :| : = |=| = •= = | = || | ||h | h I Ml 

Db 172 SVLVWLIVVVGISPILFYSGTGIRKNKTITCYDTTSDEYLRS YFIYSMCTTVAMFCVPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 25.6 

:: | :| | : : || | ||:: : =| = = : |:|:|: : : :|| 

Db 232 L I LGC YGL I VRAL I YKDLDNS PLRRKS I YLVI IVLTVFAVS YIPFHVMKTMNLRARL 288 

Qy 257 D - SWPQGCT - QKAIKS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 

I hi : : | :|| M III -II III II M I 

Db 2 89 DFQTPEMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 33 5 



RESULT 5 

US-08-5.I3-974B-374 

Sequence 374, Application US/08513974B 
Parent No. 6114139 
GENERAL INFORMATION : 

APPLICANT: Hinuma , Shuj i 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki , Tetsuya 
APPLICANT: Fukusumi , Sho j i 
APPLICANT: Ohgi , Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 380 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: DIKE , BRONSTEIN , ROBERTS & CUSHMAN, LLP 
STREET: 130 Water Street 
CITY: Boston 
STATE : MA 
COUNTRY : USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/513 , 974B 
FILING DATE: 14 -SEP- 1995 
CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/ JP95/01599 
FILING DATE: 10 -AUG- 1995 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 
FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 
FILING DATE: 16 -MAR- 1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 
FILING DATE: 20-JAN-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6 326611 
FILING DATE: 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-2 70017 
FILING DATE: 02 -NOV- 1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 
FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA : 

APPLICATION NUMBER: JP 6-236356 
FILING DATE: 3 0 -SEP- 1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-1892 74 
FILING DATE: 11 -AUG- 1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 
FILING DATE: ll-AUG-1945 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 
FILING DATE: ll-AUG-1994 
ATTORNEY/ AGENT INFORMATION: 
NAME: Resnick, David S. 
REGISTRATION NUMBER: 34,23 5 
REFERENCE/DOCKET NUMBER: 45753 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-523-3400 
TELEFAX: 617-523-6440 
INFORMATION FOR SEQ ID NO : 3 74: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 362 amino acids 
TYPE: amino acid 
STRANDEDNESS : 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-513-974B-3 74 

Query Match 28.7%; Score 473.5; DB 3; Length 362; 

Best Local Similarity 35.9%; Pred. No. 2e-31; 

Matches 110; Conservative 57; Mismatches 122; Indels 17; Gaps 8 
Qy 10 WLA TEAILNK YYLS AF YAI EF I FGLLGNVTWFGYLFCMKNWNS SNVYLFN 60 

II MM Mi ! : II I Ml - -I M M =11 = 11 

Db 22 WAAGNATTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 

Qy 61 LS I SDFAFLCTLP I L I KS YANDKG - TYGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLL 119 

M::|i == Ml II II =111 = 1 I = = I Ml M II I I I M II 

Db 82 LALADFL YVLTLPAL I FYYFNKTDWI FGDVMCKLQRF I FHVNLYGS I LFLTC I S VHRYTG 141 



Qy 12 0 MKYPFREHFLQKKEFAI LIS LAVWALVTLE VLPMLTF IN - S VPKEEGSMC ID YAS SGNPE 178 

: =| = ||= |= =| Mill = hi = I = : I I = 

Db 14 2 WHPLKSLGRLKKKNAVYVSSLWALWAVIAPILFYSGIGVRRNKTITCYDTTADEYLR 2 01 



Qy 17 9 HNLIYSLCLTLLGFLIPLSWCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIF 2 37 

= 11 = 1 h I II h 1 =| | = = Ml || = = = =1 
Db 2 02 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKSTYLVIIVLTVF 2 58 



Qy 238 SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 295 

:= = hhh I = :||| III = = I =11 II III -II III II 

Db 259 AVS YL P FHVMKTLNLRARLD FQT PQMC AFNDKVYAT YQ VTRGL AS LNS C VD P I L YFL AGD 318 

Qy 2 96 HYREML 301 

= 1 I 

Db 319 TFRRRL 324 



RESULT 6 

US-09-102-710B-3 

; Sequence 3, Application US/09102710B 
; Patent No. 6479630 
; GENERAL INFORMATION: 

APPLICANT: Coleman, Roger 

APPLICANT: Au- Young, Janice 
; APPLICANT: Stuart, Susan G. 

; TITLE OF INVENTION : A NOVEL HUMAN PURINERGIC P2U RECEPTOR 
; FILE REFERENCE: PF-0038-1 DIV 

CURRENT APPLICATION NUMBER: US/ 0 9/ 102 ., 710B 
CURRENT FILING DATE: 1998-06-22 
; NUMBER OF SEQ ID NOS : 3 

SOFTWARE: PERL Program 
; SEQ ID NO 3 

LENGTH: 374 
TYPE : PRT 

ORGANISM: Rattus norvegius 
FEATURE : 

NAME/KEY: misc_feature 
OTHER INFORMATION: RNU09402 
US-09-102-710B-3 



Query Match 22.5%; Score 372; DB 4; Length 374; 

Best Local Similarity 33.8%; Pred. No. 4.3e-23; 

Matches 96; Conservative 49; Mismatches 125; Indels 14; Gaps 7 



Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWN3SNVYLFNLSISDFAFLCTLPILIK 77 

II I I = = II I I == =| =| I hi l = hl = = ll = =11 = 1 = 
Db 3 2 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 



Qy 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

II I = III I- Hill I I I I II I h I I = I 

Db 92 YYAQGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRSLGVLRPLHSLRWGHARYAR 151 



Qy 137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

= =1111 hi h = = 11= | : || : | | :| 

Db 152 RVAAWWVLVLACQTPVLYFVTTSVRGTRITCHDTSDRELFSHFVAYSSVMLGLLFAVPF 211 



Qy 197 SVMCFFYYKMWFLKRRSQQQA TALPLDKPQ- -RLVVLAWIFSILFTPYHIMRNLR 251 

h= I | = || : | | || | : | : | : =| = : | |:|: | | 

Db 212 SIILVCY VLMARRLLKPAYGTTGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLY 267 

Qy 252 IASRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 2 94 

= I I Mill H I II I II ::h Ml I 

Db 268 YSFR- -SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 309 



RESULT 7 

US-08-513-974B-373 

Sequence 373, Application US/08513974B 
Patent No. 6114139 
GENERAL INFORMATION: 

APPLICANT : Hinuma , Shu j i 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki , Tetsuya 
APPLICANT: Fukusumi , Sho j i 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN , 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 3 80 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 
STREET: 13 0 Water Street 
CITY: Boston 
STATE : MA 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/513 , 974B 
FILING DATE : 14-SEP-1995 
CLASSIFICATION: 53 6 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/ JP95/01599 
FILING DATE: 10 -AUG- 1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 
FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 
FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 
FILING DATE: 20 -JAN- 1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 
FILING DATE: 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUM3ER: JP 6-270017 
FILING DATE: 02-NOV-1994 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 
FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236356 
FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 
FILING DATE: ll-AUG-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 
FILING DATE: 11 -AUG- 1945 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 
FILING DATE: ll-AUG-1994 
ATTORNEY/ AGENT INFORMATION: 
NAME: Resnick, David S. 
REGISTRATION NUMBER: 34,235 
REFERENCE/DOCKET NUMBER: 45753 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-52 3-3400 
TELEFAX: 617-523-6440 
INFORMATION FOR SEQ ID NO: 373: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 3 73 amino acids 
TYPE: amino acid 
STRANDEDNESS : 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-0S-513--974B-373 

Query Match 22.5%; Score 370.5; DB 3; Length 373; : 

Best Local Similarity 33.5%-; Pred. No. 5.7e-23; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5 

Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : II II == =1 =1 I hi hhh = M = = lhh 
Db 3 2 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKIWNASTTYMFHLAVSDSLYAASLPLLVY 91 

Qy 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

II I : II! h = = 1111 MINI Ih I I = I = : I 

Db 92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCIS VHRCLGVLRPLHSLRWGRARYAR 151 

Qv 137 LIS LAVWALVTLEVLPMLTF INS VPKEEGSNC IDYAS SGNPEHNL I YSLCLTLLGFL I PL 196 

= =1111 | = | |: = : | | - | = || = | | =| 

Db 152 RVAAWWVLVLACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 211 

Qv 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQ- -RLWLAWIFSILFTPYHIMRNLRIAS 254 

Ih II I = = || | : | : | : =|:= I hh I I = 

Db 212 SVILVCYVLMARRLLKPAYGTTGGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLYYSF 2 71 

Qy 255 RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

II II II I =11111 II = = h Ml I 

Db 272 R--SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 



RESULT 3 



US-08-153-848-28 

; Sequence 28, Application US/08153848 
; Patent No. 5759804 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
APPLICANT: Gray, Patrick W. 
APPLICANT: Schweikart, Vicki L. 
; TITLE OF INVENTION: No. 5759804el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES : 64 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 
ADDRESSEE: Bicknell 

STREET: 6300 Sears Tower, 233 South Wacker Drive 
/ CITY: Chicago 

STATE : Illinois 

COUNTRY : USA 

ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/153 , 84 8 

FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 1992 
ATTORNEY /AGENT INFORMATION: 

NAME : No. 5759804and, Greta E. 

REGISTRATION NUMBER: 3 5,3 02 

REFERENCE/DOCKET NUMBER: 31794 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX: 25-3856 
INFORMATION FOR SEQ ID NO: 28: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 355 amino acids ■ 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-153-848-28 

Query Match 21.4%; Score 353; DB 1; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.5e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

= 11 ih = I 11=11= Ml I = = = = I I ll = = ll 1 = III 

Db 2 8 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 I KS YANDKGTYGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLMKYPFREHFLQKKEFA 13 5 

I = I i = = =1 =111 = 111 = 1111: = = 

Db 88 THYLINEKGLH -NAMCKFTTAFFFIGFFGS I FFITVI S IDRYLAI VLAANSMNNRTVQHG 146 



Qy 13 6 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: III Ml I I I l = = : h II - := Illh 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

II =| : h = = : - = = | h =| = = l l = = l = = I I 1 = : I I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I =1 =1 := = = l =11 = =11= I h * I I =111 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 9 

US-08-153-8.48-32 

; Sequence 32, Application US/08153848 

; Patent No. 5759804 

; GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 

APPLICANT: Gray, Patrick W. 

APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 5759804el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 64 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O f Toole, Gerstein, Murray & 
; ADDRESSEE : Bicknell 

; STREET: 63 0 0 Sears Tower, 233 South Wacker Drive 

CITY : Chicago 
; STATE: Illinois 

COUNTRY: USA 

ZIP: 60606 
COMPUTER READABLE FORM: 
: MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/153 , 848 

FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 1992 
ATTORNEY / AGENT INFORMATION: 

NAME: No. 5759804and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE /DOCKET NUMBER: 31794 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX: 25-3856 
; INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 355 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 



US-08- 153 -848-32 

Query Match 21.4%; Score 3 53; DB 1; Length 355; 

Best Local Similarity 28.1%; Pred. Mo. 1.5e-21; 



Matches 84; Conservative 66; Mismatches 13 3; Indeis 16; Gaps . 6 

Qv 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

= 1111-1 I I I I : Ml I : : : : | | | h : I I hill 

Db 2 8 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 IKSYANDKGTYGDVLCISMRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

| = || = = =| = II hi I hi IN : = = 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVIS1DRYLAIVLAANSMNNRTVQHG 146 

Qy 136 I L I SLAVWALVTLEVLPMIjTF IMS VPKEEGSMC I - D YASSGNPEHNL I YSLCLTLLGFL I 194 

: III III I I I |== : h I! == == lllh 

Db 147 VT I S LGVWAAAI L VAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLS^MCFFYYKMV-VFLKRRSQQQATALPLDKPQRLVVLAWIFSILFTPYHIMRMLRIA 253 

II H : |: = = = == = = | h =| = = | h = l = =lll= = l I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I =1 ■ =1 == -I =11 = =11= I 1= =1 I = II! 

Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312. 

RESULT 10 
US-09-299-843A-28 



Sequence 28, Application US/09299843A 
Patent No. 6107475 
GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 

APPLICANT: Gray, Patrick W. 

APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 6107475el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 66 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 

ADDRESSEE: Borun 

STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY : Chicago 

STATE : Illinois 

COUNTRY: USA 

ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/2 99 , 84 3A 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER : US 09/088,337 

FILING DATE: 01-JUN-1998 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 

FILING DATE: 17 -NOV- 1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 1992 
ATTORNEY /AGENT INFORMATION: 

NAME: Jill E. Uhl 

REGISTRATION NUMBER: 43,213 

REFERENCE/DOCKET NUMBER: 27866/32059B 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX : 

INFORMATION FOR SEQ ID NO: 28: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 3 55 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-299-843A-28 

Query Match 21.4%; Score 353; DB 3; Length 355; 

Best Local Similarity 28.1%; Pred.-No. 1.5e-21; 



Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6 

Qy IS ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

-II ! I - - I Ihlh II! I : =: = || ||::|| h Ml 

Db 20 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 37 

Qv 76 TKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135' 

| = || = = =| : II hi I hi III : 

Db 38 THYLINEKGLH - NAMCKFTTAFFF TGFFGS I F FITVI S IDRYLAI VLAANSMNNRTVQHG 146 

Qy 13b ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI -DYASSGNPEHNLIYSLCLTLLGFLI 194 

: Ml III I I I h= = h II - - IMh 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVltoFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

MM: | = ::= :: =:| |: :\::\ \::\ : : |l M : I | 

Db 203 PLLIMS YCYFRI IQTLFSCKNHKKAKAI KLILLVVIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I M M == = = l Ml : Mh I h M I = III 

Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL-— YHLYGKCL 312 



RESULT 11 
US-09-299-843A-32 

; Sequence 32, Application US/09299843A 
; Patent No. 6107475 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 

APPLICANT: Gray, Patrick W. 

APPLICANT: Schweikart, Vicki L. 

TITLE OF INVENTION: No. 6107475el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 66 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 
ADDRESSEE: Borun 

STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY : Chicago 

STATE: Illinois 

COUNTRY : USA 

ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/299 , 343A 

FILING DATE: 

CLASSIFICATION: 43 5 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 09/08.8,337 

FILING DATE: 01-JUN-1998 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 

FILING DATE: 17 -NOV- 1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 1992 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Jill E. Uhl : 

REGISTRATION NUMBER: 43,213 1 

REFERENCE /DOCKET NUMBER: 27866/32059B 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX : 

INFORMATION FOR SEQ ID NO: 32: " 
SEQUENCE CHARACTERISTICS: 
LENGTH: 355 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-299-843A-32 

Query Match 21.4%; Score 353; DB 3 ; Length 3 55; 

Best Local Similarity 28.1%; Pred". No. 1.5e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; .Gaps-... 6 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

. = =11 ll = = I 11=1-1= III I = = = =|| || = :|| h ill 

Db 2 3 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 I KS YANDKGTYGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLMKYPFREHFLQKKEFA 135 

Mi : = =1 = II 1=1 I I =IM I = 

Db 85 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHG 146 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI -DYAS3GNPEHNLIYSLCLTLLGFLI 194 

■ III Ml I I I I = = • I = I I == == 1111 = 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 



Qy 195 PL55VMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 
Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI -KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDS WPQGCTQKAI KS I YTLTRPLAFLNSAINPI F YFLMGDHYREMLI SKFRQYFKSL 312 

I =| =| == : = | =|| : =||= I |= =| I =111 
Db 257 KLYDFFPS CDMRKDLRLALS VTETVAFSHCCLNPL I YAFAGEKFRRYL YHLYGKCL 312 



RESULT 12 
US-09-088-337B-28 

; Sequence 28, Application US/09088337B 
; Patent No. 6348574 

GENERAL INFORMATION: 

APPLICANT: Godiska, Ronald 
; Gray, Patrick W. 

Schweikart, Vicki L. 
; TITLE OF INVENTION: No. 6348574el Seven Transmembrane Receptors 

NUMBER OF SEQUENCES : 66 

CORRESPONDENCE ADDRESS: 
/ ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 

; Bo run 

; STREET: 6300 Sears Tower, 233 South Wacker Drive 

CITY: Chicago 
; STATE : Illinois 

COUNTRY : USA 

ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA ■:■ 

APPLICATION NUMBER: US/09/088 , 337B 

FILING DATE: Ol-Jun-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,848 

FILING DATE: 17 -NOV- 1993 

APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17-NOV-1992 
ATTORNEY/ AGENT INFORMATION: 

NAME: No. 6348574and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE/DOCKET NUMBER: 317 94 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 

TELEX: 25-3856 
INFORMATION FOR SEQ ID NO : 28: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 355 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
. SEQUENCE DESCRIPTION: SEQ ID NO : 2 8: 



US-09-G83-337B-28 



Query Match 21.4%; Score 353; DB 4; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.5e-21; 

Matches 84; Conservative 66; Mismatches 13 3; Indels 16; Gaps 6 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWMSSNVYLFNLSISDFAFLCTLPIL 75 

= 11 M- I 11 = 11 = Ml I = = = = ll ||::|| |: I I I 

Db 28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76 I KS YANDKGTYGDVLC I SNRYVLHTNL YTS I LFLTF I SMDRYLLMKYPFREHFLQKKEFA 135 

hll = = =| = 11 hi Ihllll : = : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMMNRTVQHG 146 

Qy 13 6 I L I SLAVWAL VTLEVLPMLTF INS VPKEEGSNCI - D YAS SGMPEHNL I YSLCLTLLGFLI 194 

= Ml Ml I I I | = = = h M := == lllh 

Db 147 VTI SLGVWAAAI LVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 



Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

MM: |: = := :: ::| |: : | : : | | : : | : : || | : : | | 

Db 203 PLLIMSYCYFRI IQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 



Qv 254 SRLDS WPQGCTQKAI KS I YTLTRPLAFLNS AINP I F YFLMGDHYREML I S KFRQYFKSL 312 

I : I M - -I Ml = MM I MM I : I I I 

Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL— -YHLYGKCL 312. 



ESULT 13 

S-C9-088-337B-32 
Sequence 32, Application US/09088337B - 
Patent No. 6348574 

GENERAL INFORMATION: ^ 
APPLICANT; Godiska, Ronald 
Gray, Patrick W. 
Schweikart, Vicki L. 
TITLE OF INVENTION: No. 6348574el Seven Transmembrane Receptors 
NUMBER OF SEQUENCES: 66 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & 
Borun 

STREET: 6300 Sears Tower, 233 South Wacker Drive 
CITY: Chicago 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60606 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent-In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/088 , 337B 

FILING DATE: Ol-Jun-1998 

CLASSIFICATION : <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/153,843. 

FILING DATE: 17 -NOV- 1993 



APPLICATION NUMBER: US 07/977,452 

FILING DATE: 17 -NOV- 1992 
ATTORNEY/ AGENT INFORMATION: 

NAME: No. 6348574and, Greta E. 

REGISTRATION NUMBER: 3 5,3 02 

REFERENCE /DOCKET NUMBER: 317 94 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (312) 474-6300 

TELEFAX: (312) 474-0448 
; TELEX: 25-3856 

INFORMATION FOR SEQ ID NO : 32: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 3 55 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
US-09-088-337B-32 

Query Match 21.4%; Score 353; DB 4; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.5e-21; 



Matches 84; Conservative 66; Mismatches 13 3; Indels 16; Gaps 6 

Qy 16 ILNKY YLS AFYAI EF I FGLLGNVTVVFGYLFCMKNWNSSNVYLFNLS I SDFAFLCTLP I L 75 

: II ||:= I MMh III I : : = :|| ||::|| h Ml 

Db 2B VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Oy 7 6 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKJEFA 13 5 

| : | ! : : : | : | | | : | \ j : | | | | : : : 

.Cb S\s THYLINEKGLH-NA^ICKFTTAFFFIGFFGSIFFITVISIDRYLAIVLAAN"SMNNRTVQHG 146 

Qy 13 6 I L I SLAVWAL VTLEVLPMLTFINS VPKEEGSNC I - D YAS SGNPEHNL I YSLCLTLLGFL I 194 

= Ml IN I I ! I- : h I! :: == M|l : 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQE I WPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWTjAWIFSILFTPYHIMRNLRIA 253 

M :| : |:::: ::::| |: : | : : | | : : | : : | | | : : | | 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312. 

I =| =| - =:| Ml = MM I MM I - III 

Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



RESULT 14 

US--09-170-496D-130 

; Sequence 130, Application US/09170496D 
; Patent No. 6555339 
; GENERAL INFORMATION: 

APPLICANT: Behan, Dominic P. 
; APPLICANT: Chalmers, Derek T. 

; APPLICANT: Liaw, Chen W. . i 

; TITLE OF INVENTION: No. 655333 9-.3ndogenous , Const itutively Activated. Human 

Protein- Coupled 

; TITLE OF INVENTION: Receptors 
; 5'ILE REFERENCE: AREN-0 04 0 

CURRENT APPLICATION NUMBER: US/09/ 1.7 0 , 4 96D 



; CURRENT FILING DATE: 1998-10-13 
NUMBER OF SEQ ID NOS: 294 
SOFTWARE: Patentln version 3.1 
; SEQ ID NO 13 0 
LENGTH: 3 55 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-170-496D-130 

Query Match 21.4%; Score 353; DB 4; Length 355; 

Best Local Similarity 28.1%; Pred. No. 1.5e-21; 



Matches 84; Conservative 66; Mismatches 13 3; Indels 16; Gaps 6; 

Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

= 11 ll = = I ! I = I I : Ml I = = = = || ||::|| h M! 

Db 2 8 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 37 

Qy 76 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA: 135 

hll = = H * I I hi Ihllll : = = 

Db 88 THYL INEKGLH -NAMCKFTTAFFF IGFFGS I FF I TVI S IDRYLAI VLA ANSMNNRTVQHG 146 •; '."-V 

Qy 13 6 I L I SLAVWALVTLEVLPMLTF INS VPKEEGSNC I - D YAS SGNPEHNLI YSLCLTLLGFL I 194 

: Ml III I I I | = : = |: || :: :: ||||: 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAVVIFSILFTPYHIMRNLRIA 253. . V ' ^ , 

|| :| : ]:::: :: S = | !: : | : = j | : | : : | | | : : | | ^ 

Db 2 03 PLLTMSYCYFRIIQTLFSCKNHKKAKAI -- -KLILLWIVFFLFWTPYNVMIFLETL 256 - 

Qy /254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312.. ' ; 

! =| :| := ::| :|| : :||: | |: :| | = | | | 

Db . 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 .-JM 



RESULT 15 

U3-09-170-496D-232 

; Sequence 232, Application US/09170496D 

; Patent No.. 655533 9 

; GENERAL INFORMATION: 

; APPLICANT: Behan, Dominic P. 

; APPLICANT: Chalmers, Derek T. 

; APPLICANT: Liaw, Chen W. 

TITLE OF INVENTION : No. 65553 3 9 -Endogenous , Constitutively Activated Human G 
Protein -Coupled 

; TITLE OF INVENTION: Receptors 
; FILE REFERENCE: AREN-0040 

; CURRENT APPLICATION NUMBER: US / 0 9/ 170 , 4 96D 

; CURRENT FILING DATE: 1998-10-13 

; NUMBER OF SEQ ID NOS: 2 94 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 232 

LENGTH: 355 

TYPE : PRT 

ORGANISM: Homo sapiens 
US--03- ~ 70 -496D-232 



Query Match 1 



21.4%; Score 353; DB 4; Length 355; 



Best Local Similarity 28.1%; Pred. No. 1.5e-21; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6 



Qy 16 ILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPIL 75 

= I I II- I 11 = 11= Mi I = = = = 11 ll = = ll 1= IN 

Db 20 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 

Qy 76" IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

1 = 11 : : :| =11 1 = 1 Ihllil = = = 

Db 88 THYL INEKGLH - NAMCKFTTAFFF IGFFGS I FF I TVI S IDRYLAI VLAANSMNNRTVQHG 146 

Qy 135 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

= 111111 I I I l = = : 1= II == == lllh 

Db 147 VTISLGVWAAAILVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

Qy 195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLR1A 253 

|| =| : |:::: :: : = | | : : | : : | | : : | : : | | | : : | | 

Db 203 PLLIMSYCYFRI IQTLFSCKNHKKA KAKKLILLWIVFFLFWTPYNVMIFLETL 256 

Qy 25 4 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 

I =1 =1 == ==l =11 = =11= I 1= =1 I =111 
Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 



Search completed: December 12 , 2003, 18:09:39 
Job time : 22 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: December 12, 2003, 18:08:37 



; Search time 31 Seconds 
(without alignments) 
1901.833 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-891-138A-2 
1650 

1 MAQNLS CENWLATEAI LNKY REMLISKFRQYFKSLTSFRT 317 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 684280 seqs, 185983659 residues 

Total number of hits satisfying chosen parameters ; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



684280 



Database 



Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBC0MB .pep : * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US0 8_PUBCOMB .pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep : * 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB .pep : * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB .pep : * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB .pep : * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep : * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep : * 
16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB . pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ALIGNMENTS 



RESULT 1 

US-09-891-138A-2 

; Sequence 2, Application US/09891138A 

; Publication No. US20030083245A1 

; GENERAL INFORMATION: 

; APPLICANT: Lin, Daniel Chi -Hong 



APPLICANT : Zhao , Jiagang 
APPLICANT : Chen, Jin-Long 
APPLICANT: Cutler, Gene 
APPLICANT: Tularik Inc. 

TITLE OF INVENTION: No. US2 003 00 8324 5Alel Receptors 
FILE REFERENCE: 01878 1 - 0 062 10US 

CURRENT APPLICATION NUMBER: US/ 0 9/8 91 , 13 8A -r 
CURRENT FILING DATE: 2001-06-25 
PRIOR APPLICATION NUMBER: US 60/213,461 
PRIOR FILING DATE: 2000-06-23 
NUMBER OF SEQ ID NOS : 2 6 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 2 
LENGTH: 317 
TYPE: PRT 

ORGANISM: Mus musculus 
FEATURE : 

OTHER INFORMATION: mouse TGR18 G-protein coupled receptor (GPCR) 
US-09-891-138A-2 

Query Match 100.0%; Score 1650; DB 11; Length 317; 

Best Local Similarity 100.0%; Pred. No. 1.6e-141; 

Matches 317; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

^ 1 1 1 1 II I M: M I !l ! II 1 1 1 !l i Ih 1 1 1 1 1 1 1 1 IN m M 1 1 1 1 1 MM 

Db 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

Qy 61 LS I SDFAFLCTLP I L I KS YANDKGTYGDVLC I SNRYVLHTNLYTS ILFLTFI SMDRYLLM 120 

M I Ml M I ill 1 1 Ml 1 1 1 I Ml 1 1 ! 1 1 MM 1 1 M 1 1 M I M 1 1 II M 

Db 61 LS I SDFAFLCTLP I L IKS YANDKGTYGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLM 120 

Qy' 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

II I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I ! I I I I 
Db 121; KYPFREHFLQKKEFAI L I SLAVWALVTLEVLPMLTF INS VPKEEGSNC IDYAS SGNPEHN 180 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

1 1 1 T i 1 i I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 i 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 1 1 1 

Db 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

Qy 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

■ 1 1 1 1 1 M 1 1 III IMM Ml Ml 1 1 1 1 III Mill Mlllllllll 

Db 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

Qy 301 LISKFRQYFKSLTSFRT 317 

■ : III 
Db 301 LISKFRQYFKSLTSFRT 317 



RESULT 2 

US-10-225-567A-567 

; Sequence 567, Application US/10225567A 
; Publication No. US20030113798A1 
; GENERAL INFORMATION: 

APPLICANT: Lifespan Biosciences 
; APPLICANT: Brown, Joseph P. 

APPLICANT: Burmer, Glenna C. 



APPLICANT: Roush, Christine L. 

TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN -COUPLED 
RECEPTORS (GPCRS) 

FILE REFERENCE: 192 0-4-4 

CURRENT APPLICATION NUMBER: US/10/225 , 567A 

CURRENT FILING DATE: 2001-12-19 

PRIOR APPLICATION NUMBER: 60/257,144 

PRIOR FILING DATE: 2000-12-19 
; NUMBER OF SEQ ID NOS : 22 92 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 567 
LENGTH: 330 
TYPE : PRT 
; . ORGANISM: Homo sapiens 
US-10-225-567A-567 

Query Match 75.1%; Score 1238.5; DB15; Length 330; 

Best Local Similarity 72.2%; Pred . No. 3.3e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLS AF YAI EF I FGLLGNVT WFGYLFCMKNWNS SNVYLFN 60 

II I : I : I I I I II I I I I I I II I I h hill I hi hi MMIIIhlll! 
Db 1 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 60 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

II = 11 llllllhlhllll II MINIM IIIIMIIIIIIhlllh: 

Db 61 LSVSDLAFLCTLPMLIRSYANGNWI YGDVLCISNRYVLHANLYTS ILFLTFIS IDRYLI I 12 0 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

III I III MM III Ml 1 1 hi II 1 1 hi hi III : h I hill hi M 

Db 121 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 18 0 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

1 1 1 1 = I i i 1 1 1 1 1 1 1 I I II Ml I h MlhhM I MM hi I IhMIMIhl 

Db 181 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 240 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

I ! I ! M 1 1 : 1 1 1 1 1 I II I Ml I I I MIMMIII llhlllhllhh 

Db 241 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 300 

Qy 3 00 MLISKFRQYFKSLTSF 315 

lh = = I I Mill I 

Db 301 MLMNQLRHNFKSLTSF 316 



RESULT 3 

US-09-764-886-45 

; Sequence 45, Application US/09764886 

; Publication No. US2003 0139327A9 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 

FILE REFERENCE: PTZ02 

CURRENT APPLICATION NUMBER: US/ 0 9/764 , 886 
CURRENT FILING DATE: 2001-01-17 
; Prior application data removed - consult PALM or file wrapper 
NUMBER OF SEQ ID NOS: 8 8 



SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 4 5 
LENGTH: 334 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-764-886-45 

Query Match 75.1%; Score 1238.5/ DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.3e-104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

M I :|:|IM II I HIM M Mh hill I I = II : I hlilllhiill 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSI SDFAFLCTLP I L I KS YANDKGTYGDVLC I SNRYVLHTNL YTS ILFLTF I SMDRYLLM 120 

I I : I I I M I I I I : I h M I I I II I I I I I I M I I I I I I I I I I I II I I h I I I I = = 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

MM hi llllhlhl III = h I Ml M 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

MM M II Ml I II Ml II MM MMM-I Mill- IhhIMIhl 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

< I I I I M I M I I I I || I III I I I = 111111111 llhlllhllhh 

Db 24 5 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

Qy 3 00 ML I S KFRQYFKS LTS F 315 

I I : : : I I I I I I I I 
Db 305 MLMNQLRHNFKSLTSF 320 



RESULT 4 

US-09-764-886-70 

Sequence 70, Application US/09764886 
Publication No. US20030139327A9 
GENERAL INFORMATION: 
APPLICANT: Rosen et al . 

TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
FILE REFERENCE: PTZ02 

CURRENT APPLICATION NUMBER: US/09/764 , 886 
CURRENT FILING DATE: 2001-01-17 

Prior application data removed - consult PALM or file wrapper 
NUMBER OF SEQ ID NOS : 8 8 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 70 
LENGTH: 3 34 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-764-886-70 



Query Match 75.1%; Score 1238.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.3e-104; 



Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1; 

Qy 1 MAQMLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 6 0 

II I HUM! II I Mill II llh hill 11 = 11 = 1 MIMIIMMM 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LS I SDFAFLCTLP I LI KS YANDKGTYGDVLC I SNRYVLHTML YTS I LFLTF I SMDRYLLM 12 0 

Ihll 1 1 1 M 1 1 = M = 1 1 i I IMIIIMMIIII II - - 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

mm: mmmmmmm mimmm mi : h i imiiim 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

MMMMMMM lllllllh MlhhM MUM 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAVVIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ - GCTQKAIKS I YTLTRPLAFLNSAINPI FYFLMGDHYRE 2 99 

1 1 1 1 h 1 1 h 1 1 1 1 ! I II I III I I I =111111111 1 1 hi 1 1 hi I hi : 

Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

Qy 300 MIiISKFRQYFKSLTSF 315 

Ih- I IIIIIM 

Db 3 05 MLMNQLRHNFKSLTSF 32 0 



RESULT 5 

US-10-272-983-36 

Sequence 36, Application US/10272983 
Publication No. US20030148450A1 
GENERAL INFORMATION: 
APPLICANT : Chen, Ruoping 
APPLICANT: Dang, Huong T. 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lin, I-Lin 

TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
FILE REFERENCE: AREN0050 

CURRENT APPLICATION NUMBER: US/lO/272 , 983 
CURRENT FILING DATE: 2002-10-17 
PRIOR APPLICATION NUMBER: US/ 09/4 17 , 044 
PRIOR FILING DATE: 1999-10-12 
PRIOR APPLICATION NUMBER: 60/109,213 
PRIOR FILING DATE: 1998-11-20 
PRIOR APPLICATION NUMBER: 60/120,416 
PRIOR FILING DATE: 1999-02-16 
PRIOR APPLICATION NUMBER: 60/121,851 
PRIOR FILING DATE: 1999-02-26 
PRIOR APPLICATION NUMBER: 60/123,946 
PRIOR FILING DATE: 1999-03-12 
PRIOR APPLICATION NUMBER: 60/123,949 
PRIOR FILING DATE: 1999-03-12 
PRIOR APPLICATION NUMBER: 60/136,436 
PRIOR FILING DATE: 1999-05-28 
PRIOR APPLICATION NUMBER : 60/136,437 
PRIOR FILING DATE: 1999-05-28 
PRIOR APPLICATION NUMBER: 60/136,439 



PRIOR FILING DATE: 1999-05-28 
PRIOR APPLICATION NUMBER: 60/13 6,567 
PRIOR FILING DATE: 1999-05-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 74 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 3 6 
LENGTH: 334 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-272-983-36 

Query Match 75.1%; Score 1238.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.3e 104; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

M I :|:|lll II I Mill II Mh MM Ihlhl Mlllllhllll 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LS I SDFAFLCTLP I L I KS YANDKGTYGDVLC I SNRYVLHTNL YT$ I LFLTF I SMDRYLLM 12 0 

Ml 1 1 1 1 1 1 1 : M : I ! 1 1 IMIIIMIMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 : = 

Db 65 LS VSDLAFLCTLPMLIRS YANGNWI YGDVLC I SNRYVLHANLYTS I LFLTF I S IDRYLI I 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

I I II II I MMIIIIIIMhl llllhlhl III =1=1 MMM : I 
Db 12 5 KY P FREHLLQKKE FAI L I SLAI WVL VTL ELLPILPLIN P V I TDNGTTCND FAS S GD PN YN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

II I h II M II II I M IMIIIIh MlhhM lllllhll IhMllllhi 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

1 1 1 1 1 H I I = I I I I I I II I III I I I MIMIIMI Mhlllhllhh 
Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 



Qy 300 ML I S KFRQ YFKS LTS F 315 

lh::'l MIIMI 
Db 3 05 MLMNQLRHNFKSLTSF 32 0 



RESULT 6 

US-10-393-807-36 

; Sequence 36, Application US/10393807 
; Publication No. US20030175891A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Ruoping 
; APPLICANT: Dang, Huong T. 
; APPLICANT: Liaw, Chen W. 
APPLICANT: Lin, I-Lin 

TITLE OF INVENTION: Human Orphan G Protein Coupled Receptors 
FILE REFERENCE: AREN0050 

CURRENT APPLICATION NUMBER: US / 10/393, 807 

CURRENT FILING DATE: 2003-03-21 
; PRIOR APPLICATION NUMBER: US/ 09/4 17 , 044 
; PRIOR FILING DATE: 1999-10-12 

PRIOR APPLICATION NUMBER: 60/109,213 



PRIOR 


t-i t t T TvT/"' T~v 7\ rn T-i 

FILING date 


1998 - 


1 T 
I 1 


- Z 0 


PRIOR 


APPLICAI ION 






bU/ ±ZU t 4-Lb 


PRIOR 


c 1L1NG DAIE 


1999 - 


Uz 


-16 


PRIOR 


APPL1CA1 ION 


>tt "nv/TTU CD 

NUMBEK 




C H / "1 O 1 OCT 

6 U / ±2,1 , obi 


PRIOR 


FILING DATE 


1999 - 


02 


-2 6 


PRIOR 


TV T*l T"1 T T f~*l TV m "T" /^vTVT 

APPLICATION 


NUMBER 




60/ 123 , 946 


PRIOR 


T"~i ~T T T" TVT/~* T— V 7\ rpi-i 

FILING DATE 


1999 - 


U 3 


-lz 


PRIOR 


APPLICATION 


NUMBER 


1 


60/ 123 , 949 


PRIOR 


FILING DATE 


1999- 


03 


-12 


PRIOR 


APPLICATION 


NUMBER 




60/136,436 


PRIOR 


FILING DATE 


1999- 


05 


-28 


PRIOR 


APPLICATION 


NUMBER 




60/136,437 


PRIOR 


FILING DATE 


1999- 


05 


-28 


PRIOR 


APPLICATION 


NUMBER 




60/136,439 


PRIOR 


FILING DATE 


1999- 


05 


-28 


PRIOR 


APPLICATION 


NUMBER 




60/136, 567 


PRIOR 


FILING DATE 


1999- 


05 


-28 



Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 74 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 3 6 
LENGTH: 3 34 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-393-807-36 

Query Match 75.1%; Score 123 8.5; DB 12; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.3e-104; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 
Qy 1 MAQNLSCENWLATEAI LNKYYLSAFYAI EF I FGLLGNVT WFGYLFCMKNWNS SNVYLFN 60 



Db 




Qy 



6 1 LS I SDFAFLCTL P I L I KS YANDKGT YGDVLC I SNRYVLHTNL YTS I LFLTF I SMDRYLLM 12 0 



Db 




Qy 



121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 



Db 




QY 



181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 



Db 




Qy 



241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 



Db 




Qy 



3 00 MLISKFRQYFKSLTSF 315 



Db 



3 05 MLMNQLRHNFKS LTS F 32 0 




RESULT 7 
US-09-765-034 



-2 



Sequence 2, Application US/09765034 
Patent No. US20020137887A1 
GENERAL INFORMATION: 
APPLICANT: Hedrick, Joseph A. 
APPLICANT: Lachowicz, Jean E. 
APPLICANT: Wang, Wei 
APPLICANT: Gustafson, Eric L. 
TITLE OF INVENTION: Adenosine Receptor 
FILE REFERENCE: CN01084 

CURRENT APPLICATION NUMBER: US/09/765 , 034 
CURRENT FILING DATE: 2001-01-17 
NUMBER OF SEQ ID NOS : 4 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 2 
LENGTH: 3 34 
TYPE : PRT 

ORGANISM: homo sapiens 
US-09-765-034-2 

Query Match 74.6%; Score 1231.5; DB 10; Length 334; 

Best Local Similarity 71.8%; Pred. No. 1.4e-103; 

Matches 22 7; Conservative 42; Mismatches 46; Indels 1; Gaps 1 
Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSl^JVYLFN 60 

il I MMIM II I Mill 1 1 Mh hill MM MMIIIhllll 

Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

I hi I ill MINIM II III I MM M II Ml Ml I II II MM I II- 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

II 1 1 1 II II M I II I II M h I II M h I h I II I MM hill hi M 

Db 12 5 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 134 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMVVFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

I IMMMI Mill II MUM Ih MlhhM MM MMI IhMllllh 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVP 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

IMMMI hill Ml Ml Ml M Ml Mill III || hi 1 1 hi I hh 

Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

Qy 300 MLISKFRQYFKSLTSF 315 

I h = M II I II II 
Db 3 05 MLMNQLRHNFKS LTS F 320 



RESULT 8 
US-10-270-587-2 

/Sequence 2, Application US/10270587 
; Publication No. US2003 0054487A1 
; GENERAL INFORMATION: 
; APPLICANT: Li, Yi 

TITLE OF INVENTION: Human G-Protein Coupled Receptor 

FILE REFERENCE: PF217C2 
; CURRENT APPLICATION NUMBER: US/10/270 , 587 



CURRENT FILING DATE: 2002-10-16 

PRIOR APPLICATION NUMBER: US 0 9/90 8,5 93 

PRIOR FILING DATE: 2001-07-20 

PRIOR APPLICATION NUMBER: US 08/781,456 

PRIOR FILING DATE: 1997-01-10 

PRIOR APPLICATION NUMBER: US 60/009,902 
; PRIOR FILING DATE: 1996-01-11 
; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH : 3 3 4 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-270-587-2 

Query Match 74.2%; Score 1224.5; DB 15; Length 334; 

Best Local Similarity 71.5%; Pred. No. 6.2e-103; 

Matches 226; Conservative 43; Mismatches 46; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I :|:||||.!l I Mill II llh hill Ihlhl :|lllllhllll 

Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

I hi I II MM hi hi II I MUM MM MM I II I II II II 1 1 h II I h : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISTDRYLII 124 

Qv 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

Illllll Mill Ml II I hi MM hi hi III = h I hi 1 1 hi M 

Db 12 5 KYPFREHLLQKKECAILISLAMWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184< 

Qy 181 L I YSLCLTLLGFL I PLS VMCFF YYKMWFLKRRSQQQATALPLDKPQRL VVLAVVI FS I L 240 

III MM III I III I II MM Ih MlhhM lllllhll IhMllllhl 

Db , 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

MllhMhlMMI II I III I I I MlhllMI Mhlllhllhh 

Db 245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPVAFLNSVINPVFYFLVGDHFRD 3 04 

Qy 3 00 ML I S KFRQYFKS LTS F 315 

I— I MIMM 
Db 3 05 MLMNQLRHNFKS LTS F 320 



RESULT 9 

US-09-764-886-80 

; Sequence 80, Application US/09764886 
; Publication No. US2003 0139327A9 
; GENERAL INFORMATION: 

APPLICANT: Rosen et al . 
; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 

FILE REFERENCE: PTZ02 
; CURRENT APPLICATION NUMBER: US / 09/764, 886 
; CURRENT FILING DATE: 2001-01-17 

Prior application data removed - consult PALM or file wrapper 

NUMBER OF SEQ ID NOS: 8 8 

SOFTWARE: Patentln Ver. 2.0 



; SEQ ID NO 8 0 

LENGTH: 2 88 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-764-886-80 



Query Match 69.2%; Score 1141.5; DB 12; Length 288; 

Best Local Similarity 72.6%; Pred. No. 1.7e-95; 

Matches 2 09; Conservative 40; Mismatches 38; Indels 1; Gaps 1 



Qy 2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 7 9 

I I I I II Ml: hill Ihlhl :|lllllh!llllhll MIHIMUMI 
Db 1 YYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFNLSVSDLAFLCTLPMLIRSY 60 



Qy 8 0 ANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 13 9 

II llllllllllllll MIMMMMIhlllhMMMM IIMIIIIIII 

Db 61 ANGNWI YGDVLC I SNRYVLHANL YTS I LFLTF I SIDRYLI I KYPFREHLLQKKEFAIL IS 120 

Qy 140 LAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

Ihl llllhlhl III = h I hi II hi Mill hi MM I M III II 

Db 121 LAIWVLVTLELLP I LPL INPVITDNGTTCNDFAS SGDPNYNLI YSMCLTLLGFL I PLFVM 180 



Qy 2 00 CFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDSW 2 59 

INI:: :\\\:\::\ \\\\\\:\\ | | = M M M h I I M I h I I h I II I M II 
Db 181 CFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVLFTPYHVMRNVRIASRLGSW 240 

Qy 260 PQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFR 306 

I Ml I II 'MINIMI llhlllhllhhlh:: I 
Db 241 KQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRDMLMNQLR 2 88 



RESULT 10 . . 
US-09-764-886-79 

; Sequence 79, Application US/09764686 

; Publication No. US20030139327A9 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
FILE REFERENCE: PTZ02 

CURRENT APPLICATION NUMBER: US/09/764 , 886 
CURRENT FILING DATE : 2001-01-17 

Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS : 88 

SOFTWARE: Patent In Ver . 2 . 0 
; SEQ ID NO 79 
LENGTH: 2 89 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-764-886-79 



Query Match 29.0%; Score 479; DB 12; Length 289; 

Best Local Similarity 37.3%; Pred. No. 1.8e-35; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps i 

Qy 20 YYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 7 9 

Ml I I : II I Ml =: = = | |l 1= =11:111 = : = | | == IN II I 
Db 1 YYLPAVYILVFIIGFLGNS VAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 60 



Qy 80 AND KG - TYGDVLC I SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI L I 138 

I HIM | = = I III llllll II II = II : 11 = II 

Db 61 FNKTDW I FGDVMC KLQRF I FHVNLYGS I LFLTC I S AHRYSGWYPLKSLGRLKKKNAI YV 120 

Qy 13 9 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

|: || =| = = |:| = : : | : I I I I I hi h I III 

Db 121 SVLVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 180 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

= = | =| | = : || I |h= = =h= = hhh = = HI 

Db 181 L I LGC YGL I VRAL I Y KDLDNS PLRRKSIYLVIIVLTVFAVSYIPFHVMKTMNLRARL 237 

Qy 257 D - S WPQGCT - QKAI KS I YTLTRPLAFLNS AINPI F YFLMGDHYREML 301 

I hi : = I :|| II III = = ll Ml II :| I 

Db 23 8 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 2 84 



RESULT 11 
US-09-991-225-5 

Sequence 5, Application US/09991225 
Publication No. US20030153063A1 
GENERAL INFORMATION: 
APPLICANT: Bri stol -Myers Squibb Company 

TITLE OF INVENTION: A NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR, HGPRBMY11> ■'■ 
IXPRESSED HIGHLY IN 

TITLE OF INVENTION: HEART AND VARIANTS THEREOF 
FILE REFERENCE: D0075.NP 

CURRENT APPLICATION NUMBER: US/0 9/ 991 , 225 * . . 
CURRENT FILING DATE: 2001-11-16 
PRIOR APPLICATION NUMBER: 60/249,613 
PRIOR FILING DATE: 2000-11-17 
PRIOR APPLICATION NUMBER : 60/257,611 
PRIOR FILING DATE : 2000-12-21 
PRIOR APPLICATION NUMBER: 60/305,818 
PRIOR FILING DATE: 2001-07-16 
NUMBER OF SEQ ID NOS : 81 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 5 
LENGTH: 373 
TYPE: PRT 

ORGANISM: homo sapiens 

US-C9-991-225-5 

Query Match 29.0%; Score 479; DB 12; Length 373; 

Best Local Similarity 37.3%; Pred. No. 2.5e-35; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps 6 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

III I I : II I IN == = = | II h = ll = llh= = ll == III I I I 

Db 52 YYLPAVYILVFI IGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy SO AND KG -TYGDVLC I SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI LI 138 

I =111 = 1 h: I III llllll II II = II : lh II 

Db 112 FNKTDW I FGDVMC KLQRF I FHVNLYGS I LFLTC IS AHRYSGVVYPLKSLGRLKKKNAIYV 171 



Qy 



13 9 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 



Db 172 SVLVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

:: I =| I = = II I I I = = = = I = = = I = I = I = = = : I I 

Db 232 L I LGCYGL I VRALI YKDLDNS PLRRKS I YLVI I VLTVFAVS Y I PFHVMKTMNLRARL 288 

Qy 257 D - SWPQGCT - QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 3 01 

I hi : : I :|| II III = = || I I I II = 1 I 

Db 28 9 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 33 5 



RESULT 12 
US-10-010-568-6 

; Sequence 6, Application US/10010568 
; Publication No. US2 003 0157 5 98A1 
; GENERAL INFORMATION: 

APPLICANT: Br is tol -Myers Squibb Company 

TITLE OF INVENTION: A NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR, HGPRBMY2 3 , 
EXPRESSED HIGHLY IN 

TITLE OF INVENTION: KIDNEY 

FILE REFERENCE: D0077 NP 
; CURRENT APPLICATION NUMBER: US/ 10/ 010 , 568 

CURRENT FILING DATE: 2001-12-07 
: PRIOR APPLICATION NUMBER: US 60/251,926 
; PRIOR FILING DATE: 2000-12-07 

PRIOR APPLICATION NUMBER: US 60/269,795 

PRIOR FILING DATE: 2001-02-14 
; NUMBER OF SEQ ID NOS : 55 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 6 

LENGTH: 3 73 
TYPE : PRT 

ORGANISM: RATTUS NORVEGICUS 
US-10-010-568-6 

Query Match 2 9.0%; Score 47 9; DB 12; Length 3 73 ; 

Best Local Similarity 37.3%; Pred . No. 2.5e-35; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps 6 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

Ml I I : II I Ml == =:| II h :|| = |||= = = || == III II I 
Db 52 YYLPAVYILVFI IGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 AND KG - T YGDVLC I SNRYVLHTNL YTS I LFLTF I SMDRYLLMKYPFREHFLQKKEFAI L I 13 8 

I :|||:| I =■: I III III III II II = II = 11= II 

Db 112 FNKTDW I FGD VMCKLQRF I FHVNL YGS I LFLTC I S AHRYSGVVYPLKSLGRLKKKNAI YV 171 

Qv 1 3 9 1 SLAVWALVTLE VLPMLTF INS - VPKEEGSNC IDYAS SGNPEHNL I YSLCLTLLGFL I PLS 197 

[: M :| : : |:| : : . : | : I I I I I hi h I III 

Db 17:: -SVLVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 231 

Qy 193 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 



Db 232 LILGCYGLIVRAL I YKDLDNS PLRRKS I YLVI I VLTVFAVSY I PFHVMKTMNLRARL 288 



257 D - SWPQGCT - QKAI KS I YTLTRPLAFLNSAINP I FYFLMGDHYREML 301 



Db 



2 89 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 



RESULT 13 
US-10-092-135-7 

/ Sequence 7, Application US/10092135 
; Publication No. US2003 0054374A1 
; GENERAL INFORMATION: 

; APPLICANT: Bristol -Myers Squibb Company 

TITLE OF INVENTION: POLYNUCLEOTIDE ENCODING A NOVEL HUMAN G- PROTEIN COUPLED 
RECEPTOR, 

TITLE OF INVENTION: *■ HGPRBMY2 7 

FILE REFERENCE: D0134 .NP 
; CURRENT APPLICATION NUMBER: US/ 10/ 092 , 135 
; CURRENT FILING DATE: 2002-03-06 

PRIOR APPLICATION NUMBER: US 60/273,808 

PRIOR FILING DATE: 2001-03-07 

PRIOR APPLICATION NUMBER: US 60/278,983 
; PRIOR FILING DATE: 2001-03-27 

NUMBER OF SEQ ID NOS : 7 5 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 7 

LENGTH: 373 
TYPE: PRT . 

ORGANISM: RATTUS NORVEGICUS 
U5.'-10-092-135-7 

Query Match 29.0%; Score 479; DB 15.; Length 373; 

Best Local Similarity 37.3%; Pred. No. 2.5e-35; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps 6 
Qy 2 0 Y YL S AF YA I E F I FGLLGNVT VVFG YLFCMKN WNS SNVYL FNL 8 1 SD FAFLCTL P I L I KS Y 7 9 



Db 




Qy 



80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 13 8 



Db 



112 FNKTDWI FGDVMCKLQRF I FHVNLYGS I LFLTC I S AHR YSG VVYPLKSLGRLKKKNAI YV 171 




QY 



13 9 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 



Db 




Qy 



198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 2 56 



Db 




Db 



257" D- SWPQGCT - QKAI KS I YTLTRPLAFLNS AINP I FYFLMGDHYREML 3 01 

! hi : = I : 1 1 II 1 1 !- : : I i Mi II : I I 

2 89. DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 3 35 



RESULT 14 

US-10 -270-14-4.-4- 



; Sequenced, Application US/10270144 
; Publ i c a t i on No . US 2 0 03 0 04 97 90A1 



; GENERAL INFORMATION: 

; APPLICANT: WEI , Ming-Hui et al 

TITLE OF INVENTION: ISOLATED HUMAN G- PROTEIN COUPLED 

TITLE OF INVENTION: RECEPTORS , NUCLEIC ACID MOLECULES ENCODING HUMAN GPCR 
; TITLE OF INVENTION: PROTEINS, AMD USES THEREOF 
; FILE REFERENCE: CL000750CON 

CURRENT APPLICATION NUMBER: US/10/270 , 144 
; CURRENT FILING DATE: 2002-10-15 
; PRIOR APPLICATION NUMBER: 60/205,196 

PRIOR FILING DATE: 2000-05-18 
; NUMBER OF SEQ ID NOS : 7 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 4 

LENGTH : 2 99 
TYPE: PRT 

ORGANISM: Mus musculus 
US-10-270-144-4 

Query Match 28.8%; Score 475; DB 15; Length 2 99; 

Best Local Similarity 37.3%; Pred. No. 4.4e-35; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 7 



Qy 20 YYLS AF YAI EF I FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLP IL I KS Y 79 

Ml I I = M I Ml - -I M h =|h|||= = = || - III II I 
Db 11 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 70 

Oy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I Ml M h: I Ml MUM II II = II = 11= II : 

Db 71 B'NKTDWIFGDAMCKLQRFIFHWLYGSILFLTCISAHRYSGVVYPLKSLGRLKKKNAIYV 13 0 

Qy 13 9 SLAVWAIjVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

|:. || :| = : |:| : : ■ I : I I h llhl h I Mi 

Db 131 SVLVWLIVWAISPILFYSGTGTRKNKTVTCYDTTSNDYLRSYFIYSMCTTVAMFCIPL- 189 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRIjVVLAVVIFSILFTPYHIMRIJI.RIASRL 256 

h I = = l I II I M = : : =l = = = hhh = = =11 

Db 190 VL I LGCYGL IV - - KALI YNDLDNS PLRRKS I YLVI I VLTVF AVS YIP FHVMKTMNLRARL 247 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 3 01 

I hi : : ! HI II III -ll III M :! I 

Db 24 8 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 2 94 



RES'ULT .15 
US-I0-010-568-5 

; Sequence 5, Application US/10010568 
; Publication No. US2003015759SA1 
; GENERAL INFORMATION: 

; APPLICANT: Bristol -Myers Squibb Company 

; TITLE OF INVENTION: A NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR, HGPRBMY2 3 , 
EXPRESSED HIGHLY IN 

TITLE OF INVENTION: KIDNEY 

FILE REFERENCE: D0077 NP 
; CURRENT APPLICATION NUMBER: US/ 10/ 0 10 , 568 
; CURRENT FILING DATE: 2001-12-07 

PRIOR APPLICATION NUMBER: US 60/251,926 

PRIOR FILING DATE: 2000-12-07 



PRIOR APPLICATION NUMBER: US 60/269,795 
PRIOR FILING DATE: 2001-02-14 
; NUMBER OF SEQ ID NOS : 55 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 5 

LENGTH: 37 3 
TYPE : PRT 

ORGANISM: MUS MUSCULUS 
US-10-010-568-5 



Query Match 2 8.8%; Score 475; DB 12; Length 373; 

Best Local Similarity 37.3%; Pred. No. 5.7e-35; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 7; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

III I i = II MM - ::| M h = I h I I h = = I I = = I I I II I 
Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 



Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I =11=1 h = I Ml MINI II M =11 = 11=11 = 

Db 112 FNKTDW I FGDAMCKLQRF I FHVNLYGS I LFLTC I S AHRYSG WYPLKSLGRLKKKNAI YV 171 



Qy 13 9 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

|: || =| = = | = | : : |. = | | h II M I h I II I 

Db 172 S VLVWL IWVAI S P I LF YSGTGTRKNKTVTCYDTTSNDYLRS YFI YSMCTTVAMFC I PL - 230 

Qy 198 VMCFFYYKMWFLKRRSQQOATALPL-DKPQRLWLAWIPSILFTPYHIMRNLRIASRIj 256 : . . . 

|: | ::| | || | \ \ : : : : \ : : : \ : \ , \ : : : : \ \ / 

Db 2 31 YL ILGCYGL IV - - KAL I YNDLDNS PLRRKS I YLVI I VLTVFAVS YI P FHVMKTMNLRARL 288- 

QV 2 5 7 D - SWPQGCT- QKA1KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 . 

I hi = I Ml M Ml Ml Mi ll M I J"' 

Db 289 DFQTPEMCDFNDRVYATYQVTRGLASIiNSCVDFILYFLAGDTFRRRL 33 5 



Search completed: December 12, 2 003, 18:14:04 
Job time : 32 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: December 12, 2003, 18:04:37 ; Search time 40 Seconds 

(without alignments) 
2045.067 Million cell updates/sec 



Title : US-09-891-13 8A--2 

Perfect score: 1650 

Sequence: 1 MAQNLS CENWLATE AI LNKY REMLI.SKFRQYFKSLTSFRT 317 

Scoring table: BL0SUM62 

Gapop 10.0 , Gapext 0 . 5 

Searched: 830525 seqs, 2580.52604 residues 

Total number of hits satisfying chosen parameters: 830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Vost -vVroe^ssing : Minimum Match 0% 

Maximum Match 100% 
Listing fir Ft 45 summaries 



vat abase : SPTREMBL 23:* 



1 




sp archea : * 


2 




sp bacteria : * 


3 




sp_f ungi : ' 


4 




sp human : * 


5 




sp invertebrate : * 


6 




sp mammal : * 


7 




sp_mhc : * 


8 




sp organelle:* 


9 




sp_phage : * 


10 


sp_j)lant : * 


11 


sp_roderit : * 


12 


sp virus : * 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


sp rvirus : * 


16 


sp bacteriap:* 


17 


sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
&cor£ greater than or equal to the score of the result being printed, 
and i*i derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


16 3 4 


99 


. 0 


Q 1 ""7 
31/ 


X ± 


y y yMi 6 


Q99mt6 mus musculu 




1238.5 


75 


. 1 


33 0 


4 


s~\ f~\ t-> -\r 7\ r— 

Q9BXA5 


Q9bxa5 homo sapien 


3 


1238.5 


75 


. 1 


3 34 


4 


Q8TDQ8 


Q8tdq8 homo sapien 


4 


4 82.5 


2 9 


. 2 


3 61 


13 


Q90X57 


Q90x57 xenopus lae 


5 


4 79 


2 9 


. 0 


3 73 


11 


Q8BMJ5 


Q8bmj5 mus musculu 


6 


461 


27 . 


. 9 


357 


13 


Q9DE05 


Q9de05 raja erinac 


7 


42 6 


2 5 . 


. 8 


3 37 


4 


Q96P68 


Q96p68 homo sapien 


8 


422 . 5 


25 . 


. 6 


374 


13 


057466 


057466 meleagris g 


9 


353 


21 . 


. 4 


361 


11 


Q8VHP3 


Q8vhp3 cavia porce 


10 


343 


20 . 


. 8 


360 


6 


Q8MJW8 


Q8mjw8 canis famil 


11 


338 


20 . 


. 5 


328 


11 


Q9ERK9 


Q9erk9 mus musculu 


12 


335 


2 0 , 


. 3 


355 


6 


Q9MYJ8 


Q9myj 8 callithrix 


13 


333 


20 . 


.2 


355 


11 


Q9JLY8 


Q9jly8 rattus norv 


14 


3 2 6.5 


19 , 


. 8 


354 


11 


Q8CBJ0 


Q8cbj0 mus musculu 


15 


326 


19 . 


. 8 


351 


6 


Q9MYJ9 


Q9myj9 oryctolagus 


16 


325.5 


19 , 


. 7 


354 


11 


Q8BR50 


Q8br50 mus musculu 


17 


323 . 5 


19 . 


. 6 


360 


11 


Q91ZH4 


Q91zh4 rattus norv 


18 


321 


19 . 


. 5 


344 


11 


Q8BMC0 


Q8bmc0 mus musculu 


19 


313.5 


19 . 


. 3 


309 


11 


Q8R528 


Q8r52 8 mus musculu 


2 0 


317.5 


19 . 


. 2 


355 


6 


Q8HZN7 


Q8hzn7 gorilla gor 


2 1 


316.5 


19 . 


. 2 


355 


6 


Q95LH1 


Q951hl macaca fasc 


22 


316.5 


19 . 


. 2 


3 85 


11 


Q8BQ97 


Q8bq97 mus musculu 


2 3 


316 


19 . 


. 2 


367 


13 


Q9PVG0 


Q9pvg0 carassius a 


24 


315 


19 . 


. 1 


355 


4 


Q9BYX5 


Q9byx5 homo sapien 


25 


314 


19 . 


. 0 


356 


4 


Q96TF2 


Q96tf2 homo sapien . 


2 6 


313 


19 . 


. 0 


355 


11 


QBBVW4 


Q8bvw4 mus musculu 


2 7 


313 


19 . 


. 0 


3 55 


11 


Q8BMH9 


Q8bmh9 mus musculu 


2 8 


313 


19 , 


. 0 


367 


13 


Q9PVF9 


Q9pvf9 carassius a 


29 


312.5 


18 . 


. 9 


346 


4 


Q96GE0 


Q96ge0 homo sapien 


30 


312 . 5 


18 . 


. 9 


355 


4 


Q8IUZ1 


Q8iuzl homo sapien 


31 


312 . 5 


18 . 


, 9 


355 


6 


Q8HZN8 


Q8hzn8 pan troglod 


3 2 


3 12.5 


18 . 


, 9 


3 90 


13 


Q8QGQ4 


Q8qgq4 carassius a 


33 


310.5 


18 . 


8 


339 


4 


Q8N5S7 


Q8n5s7 homo sapien • 


34 


310 


18 . 


8 


370 


11 


Q8BKK1 


Q8bkkl mus musculu 


3.5 


3 0 9.5 


18 . 


, 8 


355 


6 • 


Q8HZN3 


Q8hzn3 papio. hamad 


36 


308 . 5 


18 . 


7 


355 


6 


Q8HZN4 


Q8hzn4 cercopithec 


37 


3 08 


18 . 


7 


346 


11 


Q91Y73 


Q91y73 mus musculu 


38 


307.5 


18. 


6 


390 


13 


Q8AXM7 


Q8axm7 carassius a 


3 9 


307 


18. 


6 


362 


4 


Q8NE10 


Q8nel0 homo sapien 


40 


307 


18. 


6 


375 


11 


Q8BYI1 


Q8byil mus musculu 


41 


306 . 5 


18. 


6 


355 


6 


Q8HZN6 


Q8hzn6 pongo pygma 


42 


306 .5 


18. 


6 


3 92 


6 


Q9BDQ4 


Q9bdq4 canis famil 


43 


306 


18 . 


5 


337 


11 


Q921N3 


Q921n3 mus musculu 


44 


306 


18 . 


5 


370 


11 


Q8BLG2 


Q8blg2 mus musculu 


45 


3 04 


18 . 


4 


343 


13 


Q8JJ2 3 


Q8 j j 2 3 oncorhynchu 



ALIGNMENTS 



RESULT 1 
Q99MT6 

ID Q9.9MT6 PRELIMINARY 
AC Q99MT6; 

.DT 01-JUN-2001 (TrEMBLrel . 
DT 01-JUN-2001 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 



PRT; 317 AA. 
17, Created) 

17 7 Last sequence update) 
23, Last annotation update) 



DE G-protein coupled receptor GPR91 . 

GM GPR91. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10 0 90; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL; 

RX MEDLINE=21172992 ; PubMed=11273702 ; 

RA Wittenberger T., Schaller H.C., Hellebrand S . ; 

RT "An expressed sequence tag (est) data mining strategy succeeding in 

RT the discovery of new g-protein coupled receptors."; 

RL J. Mol. Biol. 307:799-813(2001). 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF295367; AAK01867.1; -. 

DR MGD ; MGI: 1934135; Gpr91. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR0023 7; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SO SEQUENCE 317 AA; 36701 MW; 4E22F0608F92 8E3 6 CRC64 ; 

Query Match 99.0%; Score. 1634; DB 11; Length 317; 

Best Local Similarity 99.1%; Pred. No. 8.6e-132; 

Matches 314; Conservative 1; Mismatches 2; Indels 0; Gaps 0: 
Qy ; '1 MAQNLS CENWLATEAILNKYYLS AF YAI EF I FGLLGNVTWFGYLFCMKNWNS SNVYLFN 60 

i 1 1 ! i r 1 1 1 1 1 1 1 1 1 1 1 f [ 1 1 1 1 1 m 1 1 1 1 1 m f 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 f 1 1 f 

Db 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLG^XVTWFGYLFCMKNWNSSNVYLFN 60 

QY 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

MlilMlllilMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIhl II 1 1 Ml INI 

Db 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSMLLLTVISMDRYLLM 120 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

MMIIIIIIIIIIIIIIIIIIMMIIMIIIIMIIIMIMIII MINIM Mill 

Db 121 KYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

MM I I IMMMMMIMMIM II MMIM II III IM Mill 1 M IMM MMI 

Db 101 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLICRRSQQQATALPLDKPQRLWLAWIFSIIj 24 0 

QY 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

I M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

Qy 301 LISKFRQYFKSLTSFRT 317 

1 1 II M M M 1 1 ! M ! I 

Db 3 01 LISKFRQYFKSLTSFRT 317 



RESULT 2 
Q9BXAS 



ID Q-9BXA5 PRELIMINARY; PRT; 3 30 AA. 

AC Q9BXA5 ; 

DT 01-JUN--2001 (TrEMBLrel- 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17.. Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE G-protein coupled receptor 91. 

GN GPR91. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21172992; PubMed=l 12 7 3 702 ; 

RA Wittenberger T. , Schaller H . C . Hellebrand S.; 

RT "An expressed sequence tag (est) data mining strategy succeeding in 

RT the discovery of new g-protein coupled receptors."; 

RL J. Mol. Biol. 307:799-813(2001). 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 
CC SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF348078; AAK29080.1; -. 

DR InterPro; I PRO 0 0276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 023 7; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

r Cd G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 330 AA; 38283 MV? ; 6D3C70E340866BAB CRCS4; 

Query Match 75.1%; .Score 12 3 8.5; DB 4; Length 330; 

Best Local Similarity 72.2%; Pred. No. 5.7e-98; 
Matches 2.28; Conservative 42; Mismatches 45; Indels I; Gaps 1 

Qv 1 MAQNLSCENT^IjATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II ! M M II I II I Mill II I I h l-M! Ihlhl • M I ! I I h I I I I 

Db 1 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 50 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

M = f I MMIIhlhiMI IIIMMIIIIIM 1 1 1 J I 1 1 1 1 1 1 1 h 1 1 1 h = 

Db 61 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 12 0 

Qy -21 KYPFREHFLQKKEFAILISLAVWALYTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0. 

Mlllll MIIIIMIIMhl llllhlhl M I = h I hlllhl ■ I 

Db 121, KYPFREHLLQKKEFAXLISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 180 

Qv LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

Mlhll Mill MM MM Mlh MlhhM lllllhll I MM MM hi 

Db 181 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 24 0 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

IIMhilhlHill II I Ill I II HIIMIIII 1 1 1 = I II I = I II = I = 

Db 241 FTPYm/MRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 300 

Qy 3 00 MLISKFRQYFKSLTSF 315 

lh- I Mill II 
ub 301 MLMNQLRHNFKSLTSF 316 



RESULT 3 
Q8TDQ8 



ID Q8TDQ8 PRELIMINARY; PRT; 334 AA. 

AC Q8TDQ8 ; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE P2Y purinoceptor 1 (G protein- coupled receptor 91) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I JTaxID= 9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang W., Li N. , Wan T., Cao X.; 

RT "Human P2Y purinoceptor 1 . " ; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TI3SUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

CC -i- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G- PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF247785; AAL95690.1; -. . 

DR EMBL; BC03 0 94 8; AAH30948.1; -. 

DR Genew; HGNC:4542; GPR91. ■ 

DR Inter Pro; IPR0 002 76; GPCR_ Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. - 

DR PROSITE; PS0Q237; G_PROTEIN_RECEP_jta_.l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 334 AA; 38697 MW; 33 146E1AD87F0E8 1 CRC64 ; ' 

Query Match ■ 75.1%; Score 1238.5; DB 4; Length 334; 

Best Local Similarity 72.2%; Pred. No. 5.8e-98; 
Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNL S CENWLATE A I LNKY YLS AF YAI E F I FGLLGNVT WFG YL FCMKNWNS SNVYLFN 60 

II I :|:|||| II I Mill II III: hill I h I I = I = I I I I I I I = II I I 

Db f> MAWNATCKNWL AAE AALE KY YL S I F YG I E F WGVLGNT I WYG Y I F S LKNWNS SN I YL FN 64 

Qv 61.. LS I SDFAFLCTLP I L I KS YANDKGTYGD VLC I SNRYVLHTNL YTS ILFLTF I SMDRYLLM 120 

I Ml Illlllhlhllll I I II I INI I Ml I llllllllllllhlllh: 

Db 65 LS VSDLAFLCTLPML IRS YANGNW I YGD VLC I SNRYVLHANLYTS ILFLTF I S IDRYL I I 124 

Qy 121 KYPFREHFLQKKE FAI L I SLAVWALVTLE VLPMLTF INS VPKEEGSNC ID YAS SGNPEHN 18 0 

III I Ml I MM Ml Mllhl IIMhlhl III : h I hlllhl =1 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

0y 10 1 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

IMhll II I Ml III MM III h Mlhh:! I Mllhl I IhMMIIhl 

Db 185' LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

QV 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

M I I h II h I II I I i I I I III I I ! M I I I I II II II h I I I h 1 1 h h 



Db 



245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 0,4 



Qy 3 00 MLISKFRQYFKSLTSF 315 

11= = = I IMIIII 
Db 3 05 MLMNQLRHNFKSLTSF 3 20 

RESULT 4 
Q90X57 

ID Q9 0X57 PRELIMINARY; PRT; 3 61 AA. 

AC Q90X57; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE P2Y1 nucleotide receptor. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pi-poidea; Pipidae ,- 

CC Xenopodinae ; Xenopus . 

OX NCBI_TaxID=83 55; 

RN [1] 

RP SEQUENCE FROM N. A. 

RA Cheng A.W., Tsim K.W. ; 

RT "Cloning of Xenopus P2Y1 Receptor. "; 

RL Submitted (OCT-2001) to the . EMBL/GenBank/DDBJ databases. 

DR EMBL; AF432354; AAL27614.1; 

DR InterPro"; IPR000276; GPCR_Rhodpsn. 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_F.l_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 361 .AA; 41002 MW; E5B2D605F5B57FED CRC64 ; 

Query Match 29.2%; Score 482.5; DB 1,3; Length 361; 

Best Local Similarity 36.7%; Pred. No. 2.6e-33; 

Matches 114; Conservative 58; Mismatches 124; Indels 15; Gaps 8; 

QV 1 MAQNLS CENWL ATE A I LNK YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSN 55 

•'I I I h II II! I I : | | :|| - : = | || "|:| = 

Db 18" LASGSSAGN- - VTKCSLTKTGFQFYYLPAVYIWCITGFIGNS VAIWMFIFHMKPWSSIS 75 

Qy 56 VYLFNLS I SDFAFLCTLP IL I KS YANDKG - TYGDVLC I SNRYVLHTNL YTS I LFLTF I SM 114 

lhiM:::|| ===111.1 I I =1111 I = = I I I I I II I I I I I = 

Db 76 VYMFNLALADFLYVLSLPALIFYYFNKTDWIFGDALCKLQRFLFHVNLYGSILFLTCISV 135 

Qy 115 DRYLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS - VPKEEGSNCIDYAS 17 3 

I I = =1 = Ih : I I I | | = | : |:| | : : | : | | :| 

Db 13 6 HRYTGWHPLKSLGRLKKKNSIYISALVWFIVIAGISPILFFSGTGIRKNKTITCFDTSS 195 

Qy 174 -SGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKP.Q-RLWL 232 

111 = 11=1111 == I :| I = I || I Ih: 
Db 196 DEYLRSYFIYSMCTTVFGFCIPFILILGCYGLIVRALIYKDMNWA ■ PLRKKSIYLVII 252 

Qv 233 AWI FS ILFTPYHIMRNLRIASRLD - SWPQGCT -QKAIKS IYTLTRPLAFLNSA INP IFY 2 90 

= =l = = = hh hi I = =111 hi = = I =11 II III = = || I 

Db 2 53 VLTVFAVSYLPFHVMKNLNLRARLDFQSPEMCNFNDRVYATYQVTRGLASLNSCVDPILY 312 



Qy 2 91 FLMGDHYREML 3 01 

II II H I 
Db 313 FLAGDTFRRKL 32 3 



RESULT 5 
Q8BMJ5 



ID Q8BMJ5 PRELIMINARY; PRT; 3 73 AA. 

AC Q8BMJ5 ; 

DT 01-MAR 2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE P2Y purinoceptor 1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID= 10 0 90; 

RN [1] 

RP SEQUENCE FROM N. A. 

RC STRAIN=C57BL/6J; TISSUE-Body; 

RX MEDLINE=22354683 ; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

P.T :f Analysis of the mouse transcriptome based on functional annotation cf 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR SMBL; AK030759; BAC27125.1; -. 

3Q SEQUENCE 373 AA; 42228 MW; BA38124B7347287C CRC64 ; 

. Query Match 29.0%; Score 479; DB 11; Length 373; 

Best Local Similarity 37.6%; Pred. No. 5.3e-33; 

Matches 108; Conservative 54; Mismatches 117; Indels 8; 'Gaps < 7 

Qy. 2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNV'TLFNLSISDFAFLCTLPILIKSY 7 9 

' III I I : II TIM - hi II h =|| = ||h: = || := [|| || | 
Db 52 YYLPAVYILVFIIGFLGNSVAIWMYVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 13 8 

I HI H h: I III MINI II II = II = Ih II 

Db 112 FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAIYV 171 

Qy 139 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

h II =| = : hi : : I = II h I I hi h I III 

Db 17 2 SVLVWLTVWAISPILFYSGTGTRKNKTVTCYDTTSNDYLRSYFIYSMCTTVAMFCIPL- 23 0 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

h I = = | I II |. || = : = =| = = : hhh : =- = || 

Db 231 VL I LGC YGL IV - - KAL I YNDLDNS PLRRKS I YLVI I VLTVFAVS Y I PFHVMKTMNLRARL 288 

Qvi . 2 57 D - S WPQGCT - QKAI KS I YTLTRPLAFLNS AINP I FYFLMGDHYREML 3 01 

! hi = : | =|| || Ml ::|| ||| || :| | 

Db 28 9 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 



RESULT 6 
Q9DE0 5 



ID Q9DE05 PRELIMINARY; PRT ; 357 AA. 

AC Q9DE05; 

DT 01-MAR-2001 {TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel . 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE P2Y receptor. 

OS Raja erinacea (Little skate) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes ; 

OC Elasmobranchii ; Squalea; Hypnosqualea; Pristioraj ea; Batoidea; 

OC Rajiformes; Rajidae; Raja. 

OX NCBI_TaxID=7782; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=2 04 5 9151; PubMed=10 9002 00 ; 

RA Dranoff J. A. , O'Neill A.F., Franco A.M., Cai S.Y., Connolly G.C., 

RA Ballatori N. , Boyer J.L., Nathanson M.H. ; 

RT "A primitive ATP receptor from the little skate Raja erinacea. "; 

RL J. Biol. Chem. 275:30701-30706(2000). 

DR EMBL; AF242850; AAG42684.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 023 7; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1__1 ; 1 . 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1 2 ; 1. 

KW Receptor. 

SQ SEQUENCE 357 AA; 41239 MW; 14604EE15DCBDB4 1 CRC64 ; 

Query Match 2 7.9%; Score 461; DB 13; Length 3 57; 

Best Local Similarity 35.0%; Pred. No. 1.8e-31; 

Matches 110; Conservative 64 ;■ Mismatches 112; Indels 28; Gaps 11; 

Qy 17 LNK YYLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNSSNVYLFNLSISDFAFLCTL 72 

II! Ill I I h I :|| :: ::| |: |:| :|:||| : = | :: :| 

Db 2 9 LNKGFQFYYLP IMYI I VFVTGF IGNS VALWMF I FHMRPWS S I T I YMFNLVLADLF YVFSL 88 

Qy 73 PILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQK 131 

I I I I I I :|::M h = I Ml MINI Ih II = M = I 

Db 89 PILIFYYFNKTDWIFGELLCKLXRFIFHVNLYGSILFLTCISVHRYTGWHPMKSLGRLK 148 

Oy 132 KEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIY3LCLTLL 190 

I = I : = = I I I = | = I I = = = : | | | | | | | : | 

Db 149 KKS ATI VC VCVVJ ITVMAG I S P IL YFSRTGLRRNKTNTC YDTTSKELLET YF I YSMSTTFF 208 

Ov 191 GFLIPLS - - VMCF - FYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFS ILFTPYHIM 247 

|| || =' ==|= I | =: | II | |||:: : :|:| : |:|:| 

Db 209 GFCIPFATILVCYGFIVKALI SNDMKTPL-RGKSVRLVIIVLAVFAISYLPFHVM 262 

Qy 248 RNLRIASRLDSWPQG CT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLIS 303 

Ml = Ml : II I = : = I Ml M III = = || III II M 
Db 263 KNLNLQSRL - - YYQGLDTCEWNRRVY ATYQVTRGLAS LNS CVDP I LYFLAGDTFR 315 



O.Y 
Db 



3 04 KFRQYFKSLTSFRT 317 

| = = : : | | 
316 - -RRFTNAASRFMT 32 7 



RESULT 7 
Q96P68 

ID Q96P68 PRELIMINARY; PRT; 337 AA. 

AC Q96P68; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE G protein-coupled receptor GPR99 (Putative G-protein coupled receptor) 

DE (Seven transmembrane helix receptor) . 

GN GPR80 OR GPCR OR GPR99 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX MCB I_Tax I D = 9 6 0 6 ; 

RM [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21458557; PubMed-11574155 ; 

RA Lee D.K., Nguyen T., Lynch K.R., Cheng R . , Vanti W.B., Arkhitko 0., 

RA Lewis T. , Evans J.F., George S.R., O ■ Dowd B.F.; 

RT "Discovery and mapping of ten novel G protein- coupled receptor. 

RT genes . " ; 

RL Gene 275:83-91(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Takeda S., Kadowaki S., Haga T. , Takaesu H. , Mitaku 3.; 

RT "Identification of G protein-coupled receptor genes from the human 

RT genome sequence . " ; 

RL Submitted (APR-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX ?ubMed=12098360; 

PA Wittenberger T., Hellebrand S., Munck A.; Kreienkamp H.J., 

RA Schaller H.C., Hampe W.; 

RT "GPR99 , a new G protein-couplet! receptor belonging to a new subgroup 

RT of nucleotide receptors."; 

RL BMC Genomics 3:17-17(2002). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Suwa M. , Sato T. , Okouchi I., Arita M. ; Futami K. , Matsumoto S., 

RA Tsutsumi S., Aburatani H., Asai K. , Akiyama Y.; 

RT -'Genome-wide discovery and analysis of human seven transmembrane helix 

RT receptor genes . " ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF411109; AAL26480.1; -. 

DR EMBL; AB083598; BAB89311.1; -. 

DR EMBL; AF370886; AAM76912.1; -. . 

DR EMBL; AB065877; BAC06095.1; 

DR Genew; HGNC:4531; GPR8 0 . 

DR InterPro; IPR000276; GPCR__Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 023 7; GPCRRHODOPSN . 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor; Transmembrane. 

SQ SEQUENCE 337 AA; 38251 MW; 6814EA0044756CE6 CRC64 ; 



Query Match 



25.3%; Score 426; DB 4; Length 337; 



Best Local Similarity 34.4%; Pred. No. 1.6e-28; 

Matches 101; Conservative 57; Mismatches 126; Indels 10; Gaps 6; 



Qy 17 LNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILI 76 

I :|| I I h I II |= hi h i II = = Ih =1 = 1 HI II 
Db 31 LKMHYLPVIYGIIFLVGFPGNAWISTYIFKMRPWKSSTIIMLNLACTDLLYLTSLPFLI 90 

Qy 77 KSYAN-DKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

Ih = :|| =1 h I llhllllll h II =: =1 = I I 

Db 91 HYYASGENWI FGDFMCKFIRFSFHFNLYSS ILFLTCFS I FRYCVI IHPMSCFS IHKTRCA 150 

Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI - - YSLCLTLLGFL 193 

I I = : I : I I II = I hi II I I I hi II I 
Db 151 WACAWWIISLVAVIPMTFLITSTNRTNRSACLDLTSS - -DELNTIKWYNLILTATTFC 208 

Qy 194 IPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

:|| :: | :: | | : | : | | : | : : | : | [: | |: | = | | 

Db 2 09 LPLVIVTLCYTTIIHTLTHGLQTDSCL- -KQKARRLTILLLLAFYVCFLPFHILRVIRIE 266 

Qv 2 54 SRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFR 306 

III I h = I I = = 1111 11= I = I = = h = = = =■ I I 

Db 2 67 SRLLS - ~ I SCS IENQIHEAYI VSRPLAALNTFGNLLLYWVSDNFQQAVCSTVR 318 



RESULT 3 . . l ' 

057466 • 

ID 057466 PRELIMINARY; PRT; 374 AA . 

AC 057466; • • 

DT 0I-JUM--1998 (TrEMBLrel. 06, Created) 

DT 01-JUN-1998 (TrEMBLrel. 06, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

R-E G protein coupled P2Y nucleotide receptor. 

GN TP2Y. , 

OS Meleagris gallopavo (Common turkey) . ' . 

OC Hukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Meleagrididae ; Meleagris. 

OX NCBI_TaxID=9103 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

P.C TISSUE-Blood; 

RX MEDLINE=98086419; PubMed=9415702 ; 

RA Boyer J.L., Waldo G.L.\ Harden T.K.; 

RT "Molecular cloning and expression of a.n avian G protein-coupled P2Y 

RT receptor."; 

RL Mol. Pharmacol. 52:928-934(1997). 

DR EMBL; AF031897; AAC60339.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

3Q SEQUENCE 374 AA; 42594 MW; 849C465722BDD02B CRC64 ; 

Query Match 25.6%; Score 422.5; DB 13; Length 374; 

Best Local Similarity 33.7%; Pred. No. 3.6e-28; 

Matches 10 6; Conservative 59; Mismatches 119; Indels 31; Gaps 8; 



Qy 10 WL ATE A- - ILNKYY LS AFYAI EF I FGLLGNVT WFGYLFCMKNWNS SNVY 57 

M Ml : | : : ' | I I h M I I = I I = = I 

Db 2 0 WLGGNTTAAAEAKC VFNEE FKF I LL P I S YG I VFWGL PLNS WAMW I F VS RMRP WNATTT Y -7 9 

Qy 5.8 LFNLS I SDFAFLCTLP I L I KS YAN - DKGTYGDVLC I SNRYVLHTNLYTS I LFLTF I SMDR 116 

:|||:MI =: * I I h 11= = =111 h= = MhllliM M : I 
Db 80 MFNLAISDTLYVFSLPTLVYYYADRNNWPFGKVFCKIVRFLFYANLYSSILFLTCISVHR 139 

Qy 117 YLLMKYPFREHFLQKKE FAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGN 176 

h : :| I I = I I I = II = I h = = M h : h Ml 

Db 140 YMG I CHP IRSLKWVKTKHARL I C VGWL VVT I CL I PML I FVTTS S KDNS TLCHDTTKPEE 199 

Qy 177 PEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQ 227 

:| : M : II II hill : I I I 1 = 

Db 200 FDHYVHYSSSIMALLFGIPFLVIVVCYCLMAKRLCKRS- FPSPSPRVPSYKKRSI 253 

Qy 228 RIjWLAWIFSILFTPYHIMRNLRIASRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAIN 286 

::::: ::|:|||:|||| II = III I -11111 * M : : 
Db 254 KMIIIVLTVFAICFVPFHITRTLYYTSRY- -FQADCQTLNITNFTYKITRPLASINSCLD 311 

Qv 2 87 PT F YFLMGDH YREML 3 01 

II Ih II II I 

Db "12 PILYFMAGDKYRGRL 326 : 

RESULT 9 " 
Q8VHP3 

ID QoVHP3 PRELIMINARY; PRT ; 361 AA. 

AC Q 8 VHP 3 ; 

DT 01-MAR-2002- (TrEMBLrel , 20, Created) 

DT 0 1 -MAR- 2 002 (TrEMBLrel. 20, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE CC-chemokxne receptor 4. 

OS. Cavia porcellus (Guinea pig) . 

GC ..Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Jopling L.A., Sabroe I., Andrew D.P., Mitchell T.J. , Li Y., 

11A Hodge M.R., Williams T.J., Pease J.E.; 

RT "The identification, cnaracterization and distribution of guinea pig 

fcT CCR4 and epitope mapping of a blocking antibody."; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF431971; AAL57488.1; -. 

DR InterPro; IPR000276; GPCR_Rhodps.n . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROS ITE ; PS 0 02 37; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262-; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 361 AA; 41064 MW; 93 04E8 97D4FD68 3 9 CRC64 ; 

Query Match 21.4%; Score 353; DB 11; Length 261; 

Best Local Similarity 30.0%; Pred. No. 3e-22; 

Matches 86; Conservative 62; Mismatches 131; Indels 6; Gaps -5 



Qy 21 YLSAFYAIEFIFGLLGNVTVVFGYLFCMKNWNS - SNVYLFNLSISDFAFLCTLPILIKSY 79 

= I h: I = I I I I I I I! Ill I -III Ihlll h = I I I 

Db 42 FLPPLYSLVFLFGLLGNSVWL-VLFKYKRLRSMTDVYLLNLAISDLLFVLSLPFW-GYY 99 

Qy 80 ANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 13 9 

I h :| :| - h I h 'Mill : = == = := | 

Db 100 AADQWVFGLGVCKMISWIYLVGFYSGIFFIVLMSIDRYLAIVHGVFSMRVRTFTYGVITS 159 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

|| ||: || I I : -::| h = == II = =11 =111 =1 

Db 160 LATWAVAVFASLPGLLFSTCYTERNHTSCKTRYSANSTTWKVLSSLEINILGLVIPLGIM 219 

Qy 2 00 CFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWI F3 ILFTPYHIMRNLRI ASRLDSW 259 

|||:: hi : =| : = = I hi =111 = 1= I 1 = 

Db 220 -LFCYSMII RTLQHCKSKKKNKAVKMIFAVWLFLGFWTPYNIVLFLYTLVELEVL 274 

Qy 260 PQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFR 306 



Db 2 75 QDCSLEKYLDFALQATETLAFIHCCLNPIIYFFLGEKFRKYIVQLFK 321 

RESULT 10 
Q8MJW8 

ID Q8MJW3 PRELIMINARY; PRT; 360 AA. 

AC Q8MJW8-; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) ■ 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE CC chemokine receptor 4 . 

GN CCR4 . 

OS Canis f ami liar is (Dog) . 

OC Kukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Maeda S., Okayama T . , Masuda K. , Ohno K. , Tsu j imoto H.; 

RT "Detection of CC chemokine receptor 4 (CCR4) mRWA expression in canine 

RT atopic skin lesion."; 

RL Submitted (FEB-2002) to the EMBL / GenBank/DDB J databases. 

DR EMBL; A3080188; BAC10546.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PRO 0237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS5 0 2 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor . 

SQ SEQUENCE 360 AA; 41354 MW; 69115F5209EC0908 CRC64 ; 

Query Match 20.8%; Score 343; DB 6; Length 360; 

Best Local Similarity 29.3%; Pred. No. 2.2e-21; 

Matches 98; Conservative 59; Mismatches 137; Indels 40; Gaps. 10; 

Oy 14 EAILNKYYL SAF YAI EF I FGLLGNVT WFG YLFCMKNWN 52 

| : ! | Ml II h : h I I I I I I '-II III 

Db 13 ESIYNNYYLYENIPKPCTKEGIKAFGELFLPPLYSLVFLFGLLGN-SWVWLFKYKRIik 71 



Qy 53 S - SNVYLFNLS I SDFAFLCTLP I L I KS YANDKGT YGDVLC I SNRYVLHTNLYTS I LFLTF 111 

I -Ml ihlll h =11 M h =1 M == h I 1 = 

Db 72 SMTDVYLLNLAISDLLFVLSLPFW-GYYAADQWVFGLGLCKIISWMYLVGFYSGIFFIML 130 

Qy 112 ISMDRYLLMKYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINSVPKEEGSNCIDY 171 

:|:|||| : : = : == III h = I MM : .= I 

Db 131 MSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVLASLPGLLFSTCYTERNHTYCKTK 190 

Qy 172 ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 231 

I : - II : = I I =111 INI- hi I hi 

Db 191 YSRNSTRWKVLSSLEINILGLVIPLGTM-LFCYSMII RTLQHC KNEKKS KAVRMVF 245 

Qy 232 LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 2 90 

| :| : lh- I h III-: I Mh = Hh I 

Db 246 AWALFLGFWAPYNVVLFLETLVELEVL-QDCTFERHLDYAIQATETLAFVHCCLNPVIY 3 04 

Qy 2 91 FLMGDH YREML I S KFR QYFKSLTSF 315 

| =|= :|= h h M = l = 

Db 3 05 FFLGEKFRKYLVQLFKTCRGPFMLCQYCRLLQMY 33 8 



RESULT 11 
Q9ERK9 

ID Q9ERKS PRELIMINARY; PRT; 32 8 .AA . 

AC Q9ERK9; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE P2Y6 receptor (Hypothetical 36.7 kDa protein). 

08 Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebra.ta; Euteleostomi ; 

00 Mammalia; Eutheria; Rodent ia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvEv; 

RX MEDLINE =21160052; PubMed= 11259526; 

RA Lazarowski E.R., Rochelle L.G., O'Neal W.K., Ribeiro CM. P., 

RA Grubb B.R., Zhang V., Harden T.K., Boucher R.C.; 

RT "Cloning and functional characterization of two murine uridine 

RT nucleotide receptors reveal a potential target for correcting ion 

RT transport deficiency in cystic fibrosis gallbladder ." ; 

RL J . Pharmacol . Exp. Ther . 297:43-49(2001). 

RN [2] 

RP SEQUENCE FROM' N.A. 

RA Strausberg R.; 

RL Submitted (APR-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF298899; AAG24619.1; -. 

DR EMBL; BC027331; AAH27331.1; -. 

DR InterPro; IPR0 0 02 7 6; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS50262; G__PR0TEIN_RECEP_F1_2 ; 1. 

KW Hypothetical protein; Receptor. 

SQ SEQUENCE 328 AA; 36721 MW; OOF 9DF5 ADADF 9 0 3 E CRC64 ; 



Query Match 20.5%; Score 338; DB 11; Length 328; 

Best Local Similarity 29.6%; Pred. No. 5.3e-21; 



Matches 84; Conservative 54; Mismatches 136; Indels 10; Gaps 4 

Qv 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLS1SDFAFLCTLPILIKSYA- 80 

h |:= = II h h = I I I I I = = = I = I = I I = I I = I I 

Db 2 9 LTPVYSWLWGLPLNICVIAQICA3RRTLTRSAVYTLNLALADLMYACSLPLLIYNYAR 8 8 . 

Qv 81 NDKGTYGD VLC I SMRYVLHTNL YTS I LFLTF I SMDRYLLMKYPFRE - HFLQKKEFAI LI S 139 

I =||: I | = : : lh llllll II IN = =1 I = I = = 

Db 8 9 GDHWPFGDLACRFVRFLFYANLHGSILFLTCISFQRYLGICHPLASWHKRGGRRAAWWC 14 8 

Qy 140 LAVWALVTLEVLPMLTFIMSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

|| || : || I : : = I I : I I = 11= = I I hi = 

Db 14 9 GWWIiAVTAQCLPTAVFAATGIQRMRTVCYDLSPPILSTRYLPYGMALTVIGFLLPFIAL 2 08 

Qy 200 CFFYYKMWFLKRRSQQQATALPL DKPQRLWLAWIFS ILFTPYHIMRNLRIAS 254 

| :| :| :| I h I h h =1 = 1 I hll = =1 

Db 209 LACYCRMA RRLCRQDGPAGPVAQERRSKAARMAVWAAVFAISFLPFHITKTAYLAV 265 

Qy 255 RLDSWPQGCTQKAIKSIYTLTRPLAFLMSAINPIFYFLMGDHYR 2 98 

I : : I III I :|| -II == =1 

Db 266 RSTPGVSCPVLETFAAAYKGTRPFASVNSVLDPILFYFTQQKFR 3 09 

RESULT 12 
Q9MYJ8 

ID Q9MYJH PRELIMINARY; PRT; 355 AA. 

AC 09MYJ8; 

DT 01-OCT-2000 (TrEMBLrel . 15,: Created) 

DT ' 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23 , Last annotation update) 

-DE Chemokine receptor. . •' 

GN CCR1 . 

OS Callithrix jacchus (Common marmoset) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Platyrrhini; Callitrichidae ; Callithrix. 

OX NCBI_TaxID=9483 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20153429; PubMed= 1 06 8 62 94 ; 

RA Liang M. , Rosser M.P., Ng H.P-, May K. , Bauman J.G. , Islam I., 

RA Ghannam A... Kretschmer P.J., Pu H. , Dunning L. # Snider R.M., 

RA Morrissey M.M., Hesselgesser J., Perez H.D., Horuk R. ; 

RT "Species selectivity of a small molecule antagonist for the CCR1 

RT chemokine receptor . " ; 

RL Eur. J. Pharmacol. 389:41-49(2000). 

DR EMBL; AF127528; AAF36453.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm__l;. 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS0 023 7; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

3Q SEQUENCE 355 AA; 40928 MW; 2B01C4 7E9874A2C1 CRC64 ; 



Query Match 



20.3%; Score 335; DB 6; Length 355; 



Best Local Similarity 2 8.1%; Pred. No. le-2 0; 

Matches 85; Conservative 63; Mismatches 13 8; Indels 16; Gaps 7 



Qy 12 ATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWMSSNVYLFNLSISDFAFLCT 71 

li I I = : h 11-11= || : : | = = = || Ml || I 

Db 27 ANERAFGAKLLPPLYSLVFVIGIjVGNILWWLVQYKRLKNMTSIYLLNLAISDLLFLFT 86 

Qy 72 LP I L I KS YANDKGTYGDVLC - - I SNRYVLHTNL YTS ILFLTF I SMDRYLLMKYPFREHFL 12 9 

II I = h =1 =1 I =1 11= I h = = = 1111 = = 

Db 87 LPFWISYQLKTDWVFGNAMCKVLSGFY- - YTGLYSEIFFI ILLTIDRYLAIVHAVFALRA 144 

Qy 130 QKKEFAI L I SLAV WALVTLEVLPMLTF INS VPKEEGSNC - IDYAS SGNPEHNL I YS LCLT 188 

= | :: |: :| | | || | | : : | : : I I Ml 

Db 145 RTVTFGVITS I I IWVLAILASLPGLYFAKTQWEITHRTCSLHFPHESRQEWKLFQALKLN 204 

Qv 139 LLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMR 248 

Ml ::j| || | :: | || :: : | | | : : : : | | : : | | | : : 

Db 205 LLGLVLPLLVM I VCYTG I I KILLRRPNEKKS KAVRLIFVIMI IFFLFWTPYNLTT 259 

Qy 249 NLRIASRLDSWPQGCTQ-KAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQ 307 

: || | : : = I = I : = = I I = I = I = = I = I II 

Db 2 60 LISVFQDF-LFTYGCEQGRQLDLAIQVTEMIAYTHCCVNPVIYAFVGERFRKHL RQ 314 

Ov 308 YF 309 

I 

Db 315 LF 316 



RESULT 13 • m 

Q3JLY8 * 

ID Q9JLY3 PRELIMINARY; PRT; 355" AA. 

AC 09JLY3 ; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Macrophage inflammatory protein- 1 alpha receptor. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI__TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CD ; 

RX MEDLINE=20555330; PubMed=11091494 ; 

RP. Waller A., Nayee P., Czaplewski L.G.- 

RT "Identification and characterization of a rat macrophage inflammatory 

RT protein- 1 alpha receptor. "; 

RL J. Hematother. Stem Cell Res. 9:703-710(2000). 

DR EMBL; AF119381; AAF34340.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR ' Sfam; PF00001; 7tm_l ; 1. 

DR PRINTS; PRO 023 7; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ ' SEQUENCE 355 AA; 40838 MW; 2FEB866 1D1E6E075 CRC64 ; 



Query Match 20.2%; Score 333; DB 11; Length 355; 

Best Local Similarity 28.6%; Pred . No. 1.5e-20; 

Matches 85; Conservative 69; Mismatches 119; Indels 24; Gaps. 9 

Qy 22 LSAFYAIEFIFGLLGOTTWFGYLFCMKNWNSSNVYLFNLS1SDFAFLCTLPILIKSYAN 91 

i h || h : I I : |: = = = || ||| I 

Db 37 LPPLYSFVFIIGWGNILVILVLMQHRRLQSMTSIYLFNLAVSDLVFLFTLPFWIDYKLK 96 

Qy 82 DKGTYGDVLC- - ISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 13 9 

I :|| :| . :| I i I I = I I : = = = I I I I : = = I 1= I 

Db 97 DNWVFGDAMCKLLSGFYYL- -GLYSEIFFIILLTIDRYLAIVHAVFSLRARTVTFGIITS 154 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNL IYSLCLTLLGFL1 194 

— III I =111 : : I I J"= = i =l I ill = = 

Db 155 II1WALA1LAS1PALCFFKAQWEFTHHTC SPHFPDESLKTWKRFQALKLNLLGLIL 210 

Qy 195 PLSVMCFFYYKMV-VFLKRR3QQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

II || | - : hi = = = = | h Ih = = | =l = llh= = - 

Db 211 PLLVMIICYAGIIRILLRRPNEKKAKAV RLIFAITLLFFLLWTPYNL- -TVFVS 262 

Qy 254 SRLD - SWPQGCTQ - KAIKS I YTLTRPLAFLMSAINPI FYFLMGDHYREMLI SKFRQY 308 

: I : |||: :| =1= = =111 I * I : =h I l = = = 

Db 263 AFQDVLFTNQCEQSKQLDLAIQVTEVIAYTHCCVNPIIYVFVGERFRKYLRQLFQRH 319 



RESULT 14 
Q8CBJ0 

ID Q3CBJ0 PRELIMINARY; PRT; 354 AA. 

AC QSCBJO; • 

DT 01-MAR-20G3 (TrEMBLrel , 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) ' 

DT 01 MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Chemokine . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=C57BL/6J; TISSUE-Cerebellum; 

RX MEDLINE-22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA rhe RIKEN Genome Exploration Research Group Phase I & II Team; 

RT 51 Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK035902; BAC29236.1; -. 

SQ SEQUENCE 354 AA; 40232 MW; F2 2 94£0 8 0D7 5BCA3 CRC64 ; 

Query Match 19.8%; Score 326.5; DB 11; Length 354; 

Best Local Similarity 28.2%; Pred. No. 5.5e-20; 

Matches 32; Conservative 59; Mismatches 13 3; Indels 17; Gaps 5 

Qy 21 YLS AFYAI EF I FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLP IL I KS YA 80 

:|1 Mh ! Mhlh II I : :::|| ||::|| |: ||| 

Db 34 FLSVFYALVFTFGLVGNLLWLALTNSRKPKSITDIYLLNLALSDLLFVATLPFWTHYLl 93 



Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISL 140 

: : | : : : | = I I : i I I : I I I I = = = *' III 

Db 94 SHEGLH - NAMCKLTTAF F F I GF FGG I F F I TV I S I DR YL AI VL AANS MNNRT VQHGVT I S L 152 

Qy \A 1 AVWALVTLEVLPMLTFINSVPKEEGSNCI -DYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

III I I I I = = I = II == = : : 1 1 1 : M I 

Db 153 GVWAAAILVASPQFMF TKRKDNECLGDYPEVLQEMWPVLRNSEVNILGFALPLLIM 2 08 

Qy 200 CFFYYKMWFL KRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

I |:: = = I I I : =| lh = | I I = HIMI I 

Db 209 SFCYFRIIQTLFSCKNRKKARAV RLILLWFAFFLFWTPYNIMIFLETLKLY 260 

Qy 257 DSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQ 3 07 

= =| == == = = l =11 : HI I h I I =1 = 

Db 2 61 NFFPSCDMKRDLRLALSVTETVAFSHCCLNPFIYAFAGEKFRRYLGHLYRK 311 



RESULT 15 
Q9MYJ9 

ID Q9MYJ9 PRELIMINARY; PRT; 351 AA. 

AC Q9MYJ9; 

DT 01-OCT-2000 (TrEMBLrel . 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Chemokine receptor. 

GN CCR1 . 

OS Oryctolagus cuniculus (Rabbit) . * 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolag\is . 

OX NCBI_TaxID~9986; 

PM [1] • 

RP SEQUENCE FROM N . A . 

RX MEDLINE=20153429; PubMed=10636294 ; 

RA Liang M . , Rosser M.P., Ng H.P., May K. , Bauman J.G.. , Islam I., 

RA Ghannam A., Kretschmer P.J., Pu H. , Dunning L . , Snider R.M. , 

RA Morrissey M.M., Hesselgesser J., Perez H-.D., Horuk R . ; 

RT "Species selectivity of a small molecule antagonist for the CCR1 

RT chemokine receptor . " ; 

RL Eur. J. Pharmacol. 3 8 9:41-49(2000). 

DR EMBL; AF127527; AAF36452.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS0023 7; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KVJ Receptor. 

SQ SEQUENCE 351 AA; 40534 MW; B3FED6117A14 1552 CRC64 ; 

Query Match 19.8%; Score 326; DB 6; Length 351; 

Best Local Similarity 27.8%; Pred. No. 6e-20; 

Matches 83; Conservative 65; Mismatches 137; Indels 14; Gaps 

Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILI KSYAIST 81 

| | = = |= I |:||| II = = : = = = || I hi II II IN 1 

Db 3 7 LPPLYSLVFVIGLVGNVLWLVLMKYKRLRSMTSIYLLNLAISDLLFLFTLPFWIDYRLK 96 



Qy 



82 DKGTYGDVLC - - 1 SNRYVLHTNLYTS I LFLTF I SMDRYLLMKYPFREHFLQKKE FAI L I S 139 



Db 97 DDWVFGDVLCKFLSGLY- - YVGLYSEVFFIILLTIDRYLAIVHAVFALRARTVSFGIVTS 154 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNC-IDYASSGNPEHNLIYSLCLTLLGFLIPLSV 198 

: III I =1 I : = I • : : =1 I =11. -II I 

Db 155 IVTWALTILAAIPGFRFSKTQWEFTHYTCSLHFPHESLRQWKQFQALKLNILGLVLPLLV .214 

Qy 199 MCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDS 258 

I | - | I I - : = | | = : = : | | : = | | | = = | = = : II 

Db 215 MWCYTGI IQILLRRPNEKKS RAVRL I FVIML I FFLFWTPYNL - - TLLVS AFQDS 267 

Qy 259 -WPQGCTQ-KAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSLTSF 315 

: |||: =| :|= = :||= I =h = = = | | I I 

Db 268 LFTNQCEQSKQLDLAIQVTEVIAYTHCCVNPVIYVFVGERFQKYLRQLFHTYLAKWLPF 326 

Search completed: December 12, 2003, 18:08:32 
Job time : 41 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



CM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



December 12, 2003, 17:58:12 ; Search time 40 Seconds 

(without alignments) 
1257.908 Million cell updates/se 

US-09-891-138A-2 
1650 

1 MAQNL S CENWL ATE AI LNKY REML I S KFRQYFKSLTS FRT 317 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters : 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1107863 
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Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 
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ALIGNMENTS 



RESULT 1 
AAU74904 

ID AAU74904 standard; Protein; 317 AA. 
XX 

AC AAU74 9 04; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Amino acid sequence of mouse G-protein coupled receptor TGR18 protein. 
XX 

KW Mouse; G-protein coupled; receptor; GPCR; TGR18 ; kidney disease; 

KW signal transduction modulator; cerebral cavernous malformation ; 

KW hyperlipidemia; obesity; dyslexia; cardiac myxoma; renal failure; 

KW nephritis; hypertension; liver disease; cirrhosis; blood disorder; 

KW spleen-associated disorder; immune disorder. 
XX 

OS Mus sp . 

XX 

PN WO200200719-A2 . 
XX 

PD 03-JAN-2002. 
XX 

PF 25-JUN-2001; 2 001WO-US20363 . 

XX 

PR 23-JUN-2000; 2000US J 213461P . 
XX 

PA (TULA- ) TULARIK IMC . 
XX 

PI Lin DC, Zhao J, Chen J, Cutler G; ' 

XX 

DR WPI; 2002-147880/19. 

"DR N-PSDB;- ABK12957. 

XX 

PT New G-protein coupled receptor polypeptides, useful for identifying 

PT modulators of signal transduction for treating kidney disease, 

PT hyperlipidemia, obesity, dyslexia and cardiac myxoma 
XX 

PS Claim 33; Page 59; 78pp; English. 

XX . 

CC The present invention relates to a new G-protein coupled receptor (GPCR) 

CC polypeptide comprising greater than 70% amino acid sequence identity to 

CC the amino acid sequence of human GPCRs TGR62, TGR21, TGR130.1, TGR130.2, 

CC human TGR213 or TGR92 , 80% amino acid sequence identity to mouse TGR18 

CC or 90% amino acid sequence identity to human novel edg receptor protein, 

CC as defined in the specification. The GPCR covalently linked to a solid 

CC phase is useful for identifying a compound that modulates signal 

CC transduction. The identified compounds are useful for treating 

CC kidney disease, cerebral cavernous malformations, hyperlipidemia, 

CC obesity, dyslexia and cardiac myxoma. The molecules of the invention are 

CC useful for diagnosing disorders or. conditions such as kidney-related 

CC conditions or diseases such as renal failure, nephritis, nephrotic 

CC syndrome, asymptomatic urinary abnormalities, renal tubule defects, 

CC hypertension and nephrolithiasis, liver-related disease or condition 

CC e.g. cirrhosis, infiltrations, lesions, functional disorders and jaundice 

CC and spleen-associated disorders or conditions e.g. splenic enlargement, 

CC immune disorders, blood disorders and others. Modulation of the 

CC polypeptide of the invention is useful to treat or prevent any of the 



CC above conditions or diseases. The present amino acid sequence represents 

CC the mouse GPCR TGR18 protein of the invention. This sequence is one of 

CC seven novel G protein coupled receptors of the invention (AAU74904- 

CC AAU74 911) . 
XX 

SQ Sequence 317 AA; 

Query Match 100.0%; Score 1650; DB 23; Length 317; 

Best Local Similarity 100.0%; Pred. No. 1.3e-157; 

Matches 317; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

IIIIIIIMMIIiMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIMMIII 

Db 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

IMIIMIIIIIIIIIIIIilllhlllllMlllillMIIIMMillllllllllM 

Db 61 LSI SDFAFLCTLP IL I KSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTF I SMDRYLLM 120 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

IlilllllllllllllllllMlllllllllliMllilllllllllllMIIIMilll 

Db 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

Qy 131 -LIYSLCLTLLGFLrPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAVVIFSIL 240 

MIIIMIIIIIMIIIillllllllllllllMIIIIIIIIIIIIMIIIIMlliiM 

Db 131 LIY3LCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

0y "41 FTPYHIMRNLRIA3RLDSWPQGCTQ1CAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM .3 00 

MllllilMllllllMMMMIIIIMIIMIMMIilillMMilMIIIMM 

Dh 241 FTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLN$AINPIFYFI,MGPHYREM 30.0 

Ov 301 LIS KFRQYFKSLTS FRT 317 

1 1 M F 1 1 M i I f I ! 1 1 1 

Db 301 LISKFRQYFKSLTSFRT 317 



RESULT 2 
A3P81696 

ID ABP81696 standard; Protein,; 330 AA. 
XX 

AC ABP81696; 
XX 

■D.T 04-MAR-2003 (first entry) 

XX 

DE Human purinergic receptor P2U2 protein SEQ ID NO: 567. 
XX 

KW G protein- coupled receptor; GPCR ; antigenic peptide; gene therapy; 

KW G protein- coupled receptor modulator; antibody; immune -related disease; 

KW growth-related disease; cell regeneration-related disease; AIDS; cancer ; 

XW immunological -related cell proliferative disease; autoimmune disease ; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

XW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 

KW graft versus host disease; Parkinson' s disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea ; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 

KW ulcer. 



XX 

OS Homo sapiens . 
XX 

PN WO200261087-A2 . 
XX 

PD 08-AQG-2002 . 
XX 

PF 19-DEC-2001; 2001WO-US50107 . 
XX 

PR 19-DEC-2000; 2 0 OOUS -2 57144P . 
XX 

PA (LIFE-) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR N-PSDB; ABZ42542. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating 

PT conditions in which GPCRs are involved, e.g. AIDS , Alzheimer's disease, 

PT cancer or autoimmune diseases - 
XX 

PS Disclosure; Fig 1; 52 3pp; English. 
XX 

CO The present invention describes antigenic peptides. (I) comprising: 

CC (a) any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

CC acids. .Also describee!: (1) an assay for the detection of a particular. 

CC G protein- coupled receptor (GPCR) or a candidate polypeptide in a sample,- 

CC .and (2) an isolated antibody having high specificity and high affiivity 

CC or; avidity for a particular GPCR. (I) can be used as GPCR modulators and 

CC in gene therapy. The antigenic- peptides for GPCRs are useful in detecting 

CC an antibody against a particular GPCR, and in the production of specific : 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides ,for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune -related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological -related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, ■ 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present- invention. 
XX 

SQ Sequence 330 M; 

Query Match 75.1%; Score 1238.5; DB 24; Length 330; 
Best'Local Similarity 72.2%; Pred. No. 3.7e-116; 

Matches . 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1 



Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

M I :|:||M II 1 IIMI II II h hill 11 = 11 = 1 Mlllllhllll 
Db 1 MAWNATCKNWL AAE AALE KY YL S I F YG I E F WGVLGNT I WYG Y I F S LKNWNS SN I YLFN 60 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 120 

I I : I I II I I I I I : I I : I I I I I I I I I I I I I I II I I I I I I I II I I I I I I = I I I I : = 

Db 61 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 120 

Qy 121 KYPFREHFLQKKEFAI L I SLAWALVTLEVLPMLTF INS VPKEEGSNC IDYAS SGNPEHN 180 

IMM MINI 1 1 MM hi Mill H hi III = h I hi 1 1 hi =1 

Db 121 KYPFREHLLQKKEFAI L I SLAI WVLVTLELLP I LPLINPVI TDNGTTCNDFAS SGDPNYN 180 

Qy 1ST LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

MM Mill Mil lllllllh HUM- I IMIIhll li-IIIMhl 

Db 131 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 240 

Qy 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

M I I h 1 1 h I I II 1 1 M I III I I I MM II I II I llhlllhllhh 

Db * 241 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 00 

Qy 300 MLISKFRQYFKSLTSF 315 

II = = = I Mill I I 

Db 3 01 MLMNQLRHMFKSLTSF 316 

RESULT 3 
AAY713 08 

ID AAY7I30& standard; Protein; 334 AA. - 

XX 

AC .AAV713 08; ■ ■ 

XX 

DT 02-NOV-2000 (first entry) 

XX 

DE Human orphan G protein-coupled receptor hCHNIO . 
XX 

KW Human'; orphan G protein-coupled receptor; GPCR; hCHMlO; drug screening; 
KW transmembrane receptor; expressed sequence tag; EST; signal cascade. 
XX 

OS Homo sapiens . 

XX 

PN V7O200031258-A2 . 
XX 



PD 




■JUN- 


2000 






XX 












PF 


j. 3 - 


QCT- 


1999 


9.9WO- 


US23687 


XX 












PR 


' 20- 


NOV- 


1998 


98US- 


0109213 


PR 


16- 


FEB- 


1999 


99US- 


0120416 


PR 


26- 


FEB- 


1999 


99US- 


0121852 


PR 


12 - 


MAR- 


1999 


99US- 


0123946 


PR 


i 2 - 


MAR- 


1999 


99US- 


0123949 


PR 




MAY- 


1999 


99US- 


0136436 


PR 




may- 


1999 


99US- 


0136437 


PR 


2 8^- 


MAY- 


1999 


99US- 


0136439 


PR 


2P>- 


MAY- 


1999 


99US- 


0136567 


PR 


2 8 - 


MAY- 


1999 


99US- 


0137127 


PR 


28- 


MAY- 


1999 


99US- 


0137131 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
?T 
XX 
PS 
XX 
CC 



r'f 1 
CC 

CC 
CC 
CC 
CC 
CC 
CC 
XX 



29 


- JUN- 


1999 , 


99US- 


014144 8 


29 


-SEP- 


1999 , 


9 9 US - 


0156555 


2 9 


--SEP- 


1999 , 


99US- 


015 663 3 


29 


-SEP- 


1999, 


9 9US- 


0156634 


29 


-SEP- 


1999, 


9 9US-- 


0156653 


01 


-OCT- 


1999, 


99US- 


015 72 80 


01 


-OCT- 


1999 


99US- 


0157281 


01 


-OCT- 


1999 


99US- 


0157282 


01 


-OCT- 


1999 


99US- 


0157293 


01 


-OCT- 


1999 


99US- 


0157294 


12 


-OCT- 


1999 


9 9US- 


0416760 


12 


-OCT- 


1999 


99US- 


0417044 


(AREN-) 


ARENA PHARM 


INC. 



Chen R, Dang HT, Liaw CW, Lin I; 

WPI; 2000-400068/34. 
N-PSDB; AAD01135. 

Novel human orphan G protein-coupled receptors and the encoding cDNAs 
for use in the identification of G protein-coupled receptor agonists - 

Claim 70; Page 37-88; 102pp; English. 

The present amino acid sequence is the hCHNIO, an endogenous human ■ 
orphan G protein-coupled receptor (GPCR) , expressed in kidney 'and 
thyroid. The hCHNIO cDNA .was 1 identified using the human EST (expressed 
sequence tag) 1365839 as a probe . The orphan GPCR of the invention/ like 
all GPCRs has- seven transmembrane' alpha helices with an extracellular 
1J- terminus and an intracellular C-term.inus . However, no endogenous 
ligands has yet been identified for the proteins of the invention. The 
orphan GPCRs may be used in the identification of their endogenous 
ligands, and to screen potential GPCR agonists and antagonists' 'for use as 
pharmaceutical agents. The proteins may also be used in the study of 
GPCR-mediated signalling cascades, and to elucidate their precise role in 
normal and diseased human conditions. Nucleic acid encoding human orphan 
GPCRs may be used for tissue localisation expression analysis to provide 
information about their function in healthy and pathological states. 

Sequence 334 AA; 



Query Match 75.1%; Score 1238.5; DB 21; Length 334; 

Sest Local Similarity 72.2%; Pred. No. 3.8e-116; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 



Qy 

Db 



1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

I I I MINI II I I I I I I II I I h hill Ihlhl Uiiilhllll 
5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 



n 1 LSI SDFAFLCTLPI L I KSYAND KGTYGDVLC I SNR YVLHTNLYTS I LFLTF I SMDR YLLM 12 0 

I Ml MIIIIMUIIII MINIUM 1 1 1 II I II 1 1 1 1 1 M I M 

65 LS VSDLAFLCTLPML I RS YANGNW I YGDVLC I SNR YVLHANLYTS ILFLTF I S I DR YL I I 124 



Db 



121 KYPFREHFLQKKEFAXLISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

• ! 1 1 1 1 1 1 lUlMIIIIM IIIIMM II I M: I MUM M 

123 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 



Qy 

Db 

Qy 

Db 

Qy 

Db 



131 LIYSLCLTLLGFLIPLSVMCFFYY.KMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

Mlhlllllllllll IIIIMIh MTM:: IMIIhli IhMMMhl 

185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 
241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

IMIhllhllllM II I III I I I MIMIMM Mhlllhllhh 

245 FTPYHVMRNVRIASRLGSWKQYQCTQWIMSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

300 MLISKFRQYFKSLTSF 315 

Ih:: | IMIIM 
3 05 MLMNQLRHNFKSLTSF 32 0 



AAB02842; 

22-AUG-2000 (first entry) 

Human G protein coupled receptor hCHNIO protein SEQ ID NO: 38. 

Human; G protein coupled receptor; GPCR; transmembrane receptor; 
identification; agonist; screening; : therapeutic ; pharmaceutical; 
mutant . .. . 

Homo sapiens . :■ 

■WG200022131-A2 . • 



RESULT 4 
AAB02842 

ID AAB02842 standard; Protein; 334 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
'XX 
PN 
XX 
<PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



20-APR-2000 . 

13-OCT-1999; 

13-OCT-1998 
*L2 -NOV- 1998 
20-NOV-1998 

27 - NOV- 1998 
16-FEB-1999 

26- FEB-1999 
12-MAR-1999 
12-MAR-1999 
12-MAR-1999 
12-MAR-1999 
12-MAR-1999 
12-MAR-1999 

28- E4AY-1999 
28-MAY-1999 
28-MAY-1999 
28-MAY-1999 
28-MAY-1999 
28-MAY-1999 
30-JUN-1999 

27- AUG-1999 
03-SEP-1999 



99WO-US24 065 , 

98US-0170496. 
98US-0108029. 
98US-0109213 . 
98US-0110060 . 
99US-0120416. 
99US-0121852 . 
99US-0123944 . 
99US-0123945. 
99US-0123946. 
99US-0123948 . 
99US-0123949. 
99US-0123951. 
99US-0136436. 
99US-0136437 . 
99US-0136439. 
99US-0137127 . 
99US-0137131 . 
99US-0137567 . 
99US-0141448 . 
99US-0151114 . 
99US-0152524 . 



99US-0156633 . 
99US-0156555 . 
99US-0156634 . 



PR 29-SEP-1999; 

PR 29-SEP-1999; 

PR 29-SEP-1999; 
XX 

PA (AREN-) ARENA PHARM INC. 
XX 

PI Behan DP, Lehmann-Bruinsma K, Chalmers DT, Chen R, Dang HT; 

PI Gere M, Liaw CW, Lin I, Lowitz K, White C; 

XX 

DR WPI; 2000-317986/27. 

DR N-PSDB; AAA46036. 
XX 

PT Non- endogenous , human G protein-coupled receptors for screening 

PT receptor, inverse or partial agonists useful as therapeutic agents 

XX 

PS Example 1; Page 117-118; 187pp; English. 
XX 

CC The present invention describes transmembrane receptors, preferably 

CC human G protein coupled receptors (GPCR) , for which the endogenous 

CC ligand is unknown (orphan GPCR receptors) . More specifically the present 

CC invention relates to non - endogenous , constitutively activated versions 

CC of a human GPCR. These non- endogenous human GPCRs can be useful for 

CC the direct identification of candidate compounds as receptors agonists, 

CC inverse agonists or partial agonists for use as pharmaceutical agents. 

CC AAA46017 to AAA46126 and AAB02825 to AAB02859 represent sequences used in 

CC the exemplification of the present invention. • . 

XX 

3Q Sequence 3 34 AA; 

Query Match 75.1%; Score 1238.5; DB 21; Length 334; 

Best Local Similarity 72.2%; Pred. No. 3.8e-116; 

Matches 22 8; Consei;vative 4 2; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKJSIWNSSNVYLFN 60 

I! I : hi II I II I Mill M II h hill I hi hi : I II II I hi I II 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 51 LS I SDFAFLCTLP I L I KS YANDKGT YGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLM 120 

Ihll llllllhlhllll IIIIIIIIIMIII IIIIMIIIIIIhlllh: 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKE FAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180. 

Illllll MM MIMI II hi II Mhlh I III = h I hll I hi =1 

Db 125 KYPFRKHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 101 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

MM hi Ml III I Ml Mlllllh MlhhM lllllhll IhMIIMhl 

Db 18 5 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 24-1 FTPYHIMRNLRIASRLDSWPQ-GCTOKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 299 

llllhllhllllll II I III I I I =1111 MM I MhlMhllhh 

Db 245 FTPYHVMRNYRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 3 04 

Qy 3 00 ML I S KFRQYFKSLTS F 315 

I h : = I M II II I 
Db 3 05 MLMNQLRHNFKSLTSF 320 



RESULT 5 
ABB90381 

ID ABB90381 standard; Protein; 334 AA. 
XX 

AC ABB90381; 
XX 

DT 24-MAY-2002 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2757. 
XX 

KW Cytostatic; immunosuppressive; nootropic; neuroprotective; antiviral; 

KW antiallergic; hepatotropic ; antidiabetic; antiinflammatory; antiulcer; 

KW vulnerary; anticonvulsant; antibacterial; antifungal; antiparasitic ,- 

KW cardiant; gene therapy; cancer; immune disorder; cardiovascular disorde 

KW neurological disease; infection; human; secreted protein. 
XX 

OS Homo sapiens . 
XX 

PN . WO200190304-A2 . 

XX 

PD 29-NOV-2001. 
XX 

PF 18-MAY-2Q01; 2001WO-US16450 . 

XX . 

PR 19X*iAY-2 0 00; 2000US-2 05515P . ; : X 

XX 

?A (HTTM^k- ) HUMAN GENOME SCI INC. > 

xx : ■ 

PI Birse CE , Rosen CA; 

:>x • •• 

DP, WPT; 2002-122018/16 . ' 

7M N-PSDB; ABL90790 . 

XX 

PT Novel 14 05 isolated polypeptides, useful for diagnosis, treatment and 

PT prevention of neural, immune system, muscular, reproductive, 

PT gastrointestinal, pulmonary, cardiovascular, renal and proliferative 

PT disorders - 

XX 

PS Claim 11; SEQ ID NO 2757; 2081pp + Sequence Listing; English. - 
XX 

CC The invention relates to novel genes (ABL8 944 9-ABL90853) and proteins 

CC (ABB89040-ABB90444) useful for preventing, treating or ameliorating 

CC medical conditions e.g. by protein or gene therapy. The genes -are 

CC isolated from a range of human tissues disclosed in the specification. 

CC The nucleic acids, proteins, antibodies and (ant ) agonists are useful 

CC in the diagnosis, treatment and prevention of: (a) cancer, e.g. breast 

CC and ovarian cancer and other cancers of the adrenal gland, bone, bone 

CC marrow, breast, gastrointestinal tract, liver, lung, or urogenital; 

CC (b) immune disorders e.g. Addison's disease, allergies, autoimmune 

CC haemolytic anaemia . . autoimmune thyroiditis, diabetes mellitus, Crohn's 

CC disease, multiple sclerosis., rheumatoid arthritis and ulcerative 

CC colitis; (c) cardiovascular disorders such as myocardial ipchaemias; 

CC (d) wound healing; (e) neurological diseases e.g. cerebral anoxia and 

CC epilepsy; and (f) infectious diseases such as viral, bacterial, fungal 

CC and' parasitic infections. 

CC Note: The sequence data for this patent did not lorm part of the 



CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences. 

XX 

SQ Sequence 334 AA; 

Query Match 7 5.1%; Score 123 8.5; DB 23; Length 334; 

Best Local Similarity 72.2%; Pred . Mo. 3.8e-116; 

Matches 22 8; Conservative 42; Mismatches 45; Indels 1; Gaps 1 
Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : I : M 1 1 II I MM! II 111= hill 1 1 = 11: 1 MIIMIhllil 

Db . 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 64 

Qy 61 LSI SDFAFLCTLP ILI KS YANDKGTYGDVLC I SNRYVLHTNL YTS I LFLTF I SMDRYLLM 120 

i hi I M 1 1 1 1 = 1 1 : M I ! II III MUM Ml I M M II I M M h 1 1 1 1 - 

Db 65 L S VSDLAFLCTLPMLI RS YANGNWI YGDVLC I SNRYVLHANL YTS I LFLTF I SIDRYL 1 1 124 

Qv 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFIMSVPKEEGSNCIDYASSGNPEHN 180 

IIMIII III Mill llllhl llllhlhl II I : h I hlllhl H 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDMGTTCNDFASSGDPNYN 184 

Oy 181 LIYSLCLTLliGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

1 1 M = 1 1 1 1 1 1 1 f 1 1 f lllllllh = 1 1 ! = I - I MlllhM I h M I M I h I 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAW1FSVL 244 

Ov 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLxMGDHYRE 2 99 

MMhMhIIMM III Mi ll! I ! ! 1 1 1 1 1 ! MhMlhllhh 

Db 245 FTPYHvT^RNVRIASRLGSWKQYQCrQVVINSFYIVTRPLAFLKTSVINPVFYFL : LGDHFRD 304 

Ov 300 MLXSKFRQYFKSLTSF 315 

M— ! MIMM 

Db 3 05 MLMNQLRHNFK3LTSF 320 



RESULT 6 ' 
ABR41222 

ID ABR41222 standard; Protein; 371 AA. 

XX 

AC ABR41222 ; 

XX 

DT 02-JUN-2003 (first entry) 

XX 

DE Human DITHP. receptor . 
XX 

KW Human; dithp; diagnostic and therapeutic polynucleotide; diagnosis; 

KW cancer; cell proliferative disorder; autoimmune disorder; 

KW inflammatory disorder; infection; hormonal disorder; metabolic disorder; 

KW neurological disorder; gastrointestinal disorder; transport disorder; 

KW connective tissue disorder; drug screening; proteome analysis;- 

KW gene therapy; antisense therapy; genotyping; transgenic animal; knock in; 

KW disease model; toxicological testing; transcript imaging; 

KW receptor. 

xx : 

OS Homo sapiens. 

XX . 

FN WO 2 0 02 97031 - A2 . 

XX 



PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
F-T 

r ; T 
.:-t 

PT 
XX 
PS 
V "X 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc' 



05 


-DEC - 


2 002 




2 7 


-MAR- 


2 002 


Z UUzWU 


28 


-MAR- 


2001 


2 001US 


29 


-MAR- 


2001 


2001US 


2 9 


- MAR - 


2001 


2001US 


16 


-MAY- 


2001 


2001US 


17 


-MAY- 


2001 


- 2001US 


17 


-MAY- 


2001 


• 2001US 


19 


-JUN- 


2001 


• 2001US 


20 


- JUN- 


2001 


• 2001US 


20 


-JUN- 


2001 


• 2001US 



-279619P. 
-280067P. 
-280068P. 
-291280P. 
-291829P. 
-291849P. 
-299428P. 
-299776P. 
-300001P. 



(INCY-) INCYTE GENOMICS INC. 



Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chi nil J; 
Dufour GE., Hillman JL, Yu JY, Tuason 0, Yap PE, Amshey SR; 
Daughtery SC, Dam TC, Liu TF, Nguyen DA, Kleefeld Y, Gerstin EH; 
Peralta CH, David MH, Lewis SA, Chen AJ, Panzer SR, Harris B; 
Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

WPI; 2003-129518/12. ( 

N-PSDB; ACC46165 . 1 " 

Novel human diagnostic and therapeutic polypeptide -useful for 
identifying test compound which specifically binds to a polypeptide 
Encoded by human diagnostic and therapeutic polynucleotide, and to • 
induce antibodies 

Claim 27; SEQ ID No 757; 591pp; English. 

The invention relates to novel human diagnostic and therapeutic 
polynucleotides designated dithp (ACC46080 -ACC46749) and to their 
'encoded proteins (DITHP; ABR4113 6 -ABR4 1812 ) . The invention also relates 
to polynucleotide sequences at least 90% identical to the dithp cDNA 
sequences of the invention; recombinant vectors, host cells and 
transgenic organisms comprising a dithp nucleic acid sequence;- the 
recombinant production of DITHP proteins; antibodies specific for DITHP 
proteins; microarrays comprising dithp nucleic acid sequences; methods 
of detecting dithp nucleotide and protein sequences; methods of screening 
for compounds which specifically bind a DITHP protein; and methods of 
assessing the toxicity of test compounds using a dithp hybridisation 
probe. Dithp nucleic acid sequences and DITHP proteins may be used in the 
diagnosis of a wide variety of conditions including cancer and other cell 
proliferative disorders; autoimmune or inflammatory disorders; bacterial, 
viral, fungal or parasitic infections; hormonal disorders; metabolic 
disorders; neurological disorders; gastrointestinal disorders; transport 
disorders; and connective tissue disorders. They may also be used to 
screen for modulators of protein activity or gene expression. DITHP 
proteins can additionally be used in analysis of the proteome of a tissue 
or cell type and to induce antibodies. The dithp nucleic acids are 
additionally useful in somatic or germline gene therapy of the disorders 
mentioned above, as a source of antisense sequences, as a source of 
probes and primers, in genotyping and identification of individuals, in 
the generation of transgenic animal models of human disease or knock in 
humanised animals, in toxicological testing, and in transcript imaging. 



CC The present, sequence represents a DITHP protein which has receptor 

CC activity. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published__pct_sequences. 
XX 

SQ Sequence 3 71 AA; 

Query Match 75.1%; Score 1238.5; DB 24; Length 371; 

Best Local Similarity 72.2%; Pred. No. 4.4e-116; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I = J = ||| | I! I lllll II llh hill I hi hi :| I I II I hill I 
Db 42 MAWNATCKNWLAAEAALEKYYLS I FYGIEF V^/GVLGNTI WYGYI FSLKNWNSSN1 YLFN 101 

Qy 61 LS I SDFAFLCTLP I L I KS Y ANDKGTYGDVLC I SNRYVLHTNL YTS ILFLTF I SMDRYLLM 120 

I hi. I MllllhlhlMI 1 1 1 II M I M I! 1 1 II I II II I II M h II I h = 

Db 102 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 161 

Qy 121 KYPFREHFLQKKEFAIL I S LAVWALVTLEVLPMLTF INS VPKEEGSNC ID YAS SGNPEHN 180 

MIIMI IMIMMMIIhl IIMhlhl Ml = h I 1 = 1111 = 1 M 

Db 162 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 221. 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

MlhlMMMMM 1 1 1 1 1 1 1 h HI i- l-l IMMhll I MM I Ml hi • 

nb 222 LIYSMGLTLLGFLIPLFVMCFFYYKIMjFLKQRWRQVATAIjPLEKPIjNLVIMAlWIFSVL 281 

0v 24 1 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFIjMGDHYRE 2 9.9 

NMhllhlMMI il I Ml I ! ! MIIMIMI II hi II hi I hh 

LT/ 2 32 F'rPYHWRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRP 341* 

Qy 300 ML X S KFRQYFKSLTS F 315 • 

Ih- I MIIMI 
Db 34 2 MLMNQLRHNFKSLTSF 3 57 



RESULT 7 
AAE15633 

ID AAE15633 standard; Protein; 379 AA. 
XX 

AC AAE1S633; 
XX 

DT 12-MAR-2002 (first entry) 

XX 

I>E Huttian- G- protein coupled receptor- 3 (GCREC-3). protein. • 
XX 

KW Human; G-protein coupled receptor-3; GC^EC-3; therapy; cancer; stroke; • 

KW cell proliferative disorder; neurological; epilepsy; Parkinson's disease; 

KW Alzheimer's disease; inflammation; thyroiditis; ha emo lytic anaemia; AIDS; 

KW Acquired Immune Deficiency Syndrome; dementia; nootropic; cholelithiasis; 

KW multiple sclerosis; atherosclerosis; angina pectoris; gastroenteritis ; 

KW diabetes; ulcer; viral infection; immunosuppressive. 
XX 

OS Homo sapiens. - " 

XX 

FH Key - Location/Qualif iers 



FT 
FT 
FT 
FT 
FT 



Domain 



187 . .206 

/label= Transmembrane_domain 
234 . .253 

/label= Transmembrane_domain 
276..296 

/label= Transmembrane_domain 
319. .342 

/ label = Transmembrane domain 



Domain 



Domain 



FT 



FT 
FT 



Domain 



XX 

PN WO200198351-A2 . 
XX 



PD 


27 


-DEC- 


2001 . 






XX 












PF 


15 


-JUN- 


2001; 


2001WO 


-US19275 


XX 












PR 


16 


-JUN- 


2000; 


2000US 


-212483P 


PR 


22 


-JUN- 


2000; 


2000US 


-213954P 


PR 


29 


- JUN 


2000; 


2000US 


-215209P 


PR 


07 


-JUL- 


2000; 


2000US 


-216595P 


PR 


14 


-JUL- 


2000; 


2000US 


-218936P 


PR 


19 


- JUL- 


2000; 


2000US 


-219154P 


PR 


21 


-JUL- 


2000; 


2000US 


-220141P 



XX . . 

PA (INCY-) INCYTE GENOMICS INC. 

XX . 

PI Lai P, Baughn MR, Hafalia A^JA, Nguyen DB, : Gandhi AR, Kallick DA; 

.PI Griffin JA, Yue H, Khan FA, Patterson C, Lu DAM , Tribouley CM;' 

PI Lu Y, Walia NK, Graul R, Yao MG, Yang J, Ramkumar J, Au-Ycung- J; 

PI Elliott V3, Hernandez R, Walsh RT, -Borowsky ML, 1 ' Thornton M, He A; 
XX 

DR WPI; 2002-075627/10. 

DR N-PSDB; AAD24958 . 

XX 

PT Isolated human G-protein coupled receptor polypeptides and the use of 

PT these sequences in the diagnosis, treatment and prevention of diseases 

PT and in the assessment of exogenous compounds on the expression of the 

PT receptors - 
XX 

PS Claim 1; Page 115-116; 143pp; English. 

XX 

CC The invention relates to isolated human G-protein coupled receptor 

CC (GCREC) polypeptides and their biologically active fragments. GCREC and 

CC protein is useful in treating a disease or condition associated with an 

CC increase or decrease in expression of functional GCREC. The GCREC ' s are 

CC useful in the diagnosis, treatment and prevention of cell proliferative 

CC disorders (cancer, leukaemia, melanoma) ; neurological disorders (stroke, 

CC epilepsy, Parkinson's disease, dementia, Alzheimer's disease); autoimmune 

CC inflammatory disorder (thyroiditis, haemolytic anaemia, AIDS, multiple 

CC sclerosis) ; cardiovascular disorder (atherosclerosis, angina pectoris) , 

CC gastrointestinal disorder (ulcer, cholelithiasis, gastroenteritis), 

CC metabolic disorders (diabetes) ; viral infections (herpes virus) and in 

CC the assessment of the effects of exogenous compounds on the expression 

CC of the nucleic acid and amino acid sequences. The present sequence is 

CC human GCREC- 3 protein. 

XX 

SQ Sequence 379 AA; 



Query Match 75.1%; Score 1238.5; DB 23; Length 379; 

Best Local Similarity 72.2%; Pred. No. 4.5e-H6; 

Matches 228; Conservative 42; Mismatches 45; Indels 1; Gaps 1 

Qy 1 MAQNLSCEMWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I MMIM il I Mill II 1th hill MM Mlllllhllll 

Db 50 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFN 109 

Qy 61 LSI SDFAFLCTLP I L I KS YANDKGTYGD VLC I SNRYVLHTNLYTS I LFLTF ISMDRYLLM 120 

MM llllllhlhllll IIIIMIillllll IIMIIIIIIMhIMh: 

Db 110 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 169 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

Illllll IMMMMMIhl IMIhlhl III MM hlllhl 

Db 170 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 22 9 

Qy 131 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

IMhIMIIIIIMI IMIMIh MlhhM lllllhll MMIIMhl 

Db 230 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 289 

Qy 241 FTP YH IMRNLR I ASRLDS WPQ - GCTQKAI KS I YTLTRPLAFLNS AINP I FYFLMGDHYRE 299- 

llllhllhlMIM II I III I M MM 1 1 III I llhlllhllhh 

Db 2 30 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 349 

Qy 3 00 MLISKFRQYFKSLTSF 315 

I MM Ml MM 

Db 3 50 MLMNQLRHNFKS LTS F 3 65 



RESULT 3 •• 
ABG72131 

ID ABG72131 standard; Protein; 334 AA. .; 

XX 

AC ABG72131; 
XX 

DT 30-JAN-2003 (first entry) 

XX 

DE Human adenosine receptor. 
XX 

KW Human; mammalian; adenosine receptor; G-protein coupled receptor; 

KW GPCR; adenosine-mediated medical condition; vasodilation; hypotension; 

KW reversal of tachycardia; chronic renal disease; thyroid disorder; 

KW inflammation; asthma; hypertensive; antiarrhythmic; antiinflammatory; 

KW receptor. 

XX 

OS Homo sapiens. 

XX 

PN US2002137887-A1 . 
XX 

PD 2-6-SEP-2002 . 
XX 

PF 17-JAN-2001; 2 0 0 1US - 0765034 . 

XX 

PR 17-JAM-2001; 2001US - 0765034 . 
XX 

PA (HEDR/) HEDRICK J A. 

PA (LACH/) LACHOWICZ J E. 



PA (WANG/) WANG W. 

PA (GUST/) GUSTAFSON E L. 

XX 

PI Hedrick JA, Lachowicz JE, Wang W, Gustafson EL; 
XX 

DR WPI; 2003-074992/07. 

DR N-PSDB; ABS57291. 
XX 

PT Novel isolated mammalian adenosine receptor polypeptide useful for 

PT identifying an agonist or antagonist of the receptor for treating 

PT vasodilation, hypotension, chronic renal diseases, thyroid disorders 

PT and inflammation - 
XX 

PS Claim 2; Page 1.6-17; 19pp; English. 

XX 

CC The present invention relates to the isolation of a mammalian 

CC (human) adenosine receptor, and the polynucleotide sequence 

CC encoding it. The cloned receptor resembles a member of the 

CC G-protein coupled receptor (GPCR) superfamily that contains ; 

CC 7- transmembrane domains. The adenosine receptor is useful for ->.;' 

CC identifying agonists and antagonists of the receptor, which may be ^ 

CC useful for treating an adenosine-mediated medical condition. The 

CC adenosine receptor polypeptide sequence is also useful as an < 

CC antigen to elicit antibody production in an immunologically 

CC competent host. An antibody which binds specifically to the [ . ■ 

CC adenosine receptor is useful for treating medical conditions caused. . \\p 

CC or mediated by adenosine such as vasodilation; hypotension, reversal Mfj- 

CC of tachycardia, chronic renal diseases, thyroid disorders and: . - -ft 

CC inflammation (e.g. asthma) . The antibody can also be used to purify. .. - 

CC the adenosine receptor, or as a basis for immunoassays of the .receptor.- 

CC The polynucleotide sequence encoding the adenosine receptor is 'useful .^J 

CO for producing vectors and host cells containing the vectors. It is ; 

CC also useful for measuring expression of a mammalian adenosine jp£ 

CC receptor gene in a biological sample. The present sequence represents % 

CC human adenosine receptor. . ^ 

XX 

3Q Sequence 3 34 AA; - = 

Query Match 74.6%; Score 1231.6; DB 24; Length 334; x '- v 

Best Local Similarity 71.8%; Pred. No. 1.9e-115; < . 

Matches 227; Conservative 42; Mismatches 46; Indels 1; Gaps 1; 

Qy 1 MAQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : hl'M | M I Mill M ilh hill I hi hi =| Mill hi Ml 
Db 5 MAWNATCKNWLAAEAALEKYYLSIFYGIEFWGVLGNTIWYGYIFSLKNWNS'SNIYLFN 64 

Qy 61 LSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLM 12 0 

I h I I II II I I h I h I I II M I II I I I I II I I I I I M I I II I I M h II I h : 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISTDRYLII 124 

9y 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

MM II I 1 1 ! LI I i 1 1 1 1 II : I llllhlhl Ml : h I hill hi M 

Db 125 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 . 

Ov 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

I I I I : I I I I I I I II I I I ! I I I I ! I : : I I I = I • = I llillhll ||::|l||||: 

Db IBS LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWTF3VP .244 



Qy 

Db 

Qy 

Db 



241 FTP YHIMRNLRI ASRLDS WPQ - GCTQKAI KS I YTLTRPLAFLNS AINP I FYFLMGDHYRE 2 99 

I I I M : I I I : I I I I I I II I I I I I I I =1 I II I MM I I hi II hi Ihh 
245 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPLAFLNSVINPVFYFLLGDHFRD 304 



3 00 MLISKFRQYFKSLTSF 315 

I!::: I IIIIMI 
3 05 MLMNQLRHMFKSLTSF 32 0 



RESULT 9 
AAW19854 
ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 



FT 
FT 

FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

FT 
FT 

FT 
FT 

FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



AAW19854 standard; Protein; 334 AA. 
AAW19854; 

ll-SEP-1997 (first entry) 
Human purinergic receptor P2U2 . 

P2U2 receptor; purinergic receptor; diagnosis; therapy. 
Homo sapiens . 



Key 

Domain 
::o ; :via i n 

Domain. 

Domain 

Domain 

Domain 



Domain 



Domain, 



Domain 



Domain 



Lccation/Qualif iers 
■ 1 . .23 

/label= N-terminal_domain 
24 . .49 

/label= TMI ■ 

/note= "transmembrane domain I " 

ISO . .60 

/label= ICDI 

/note= "intracellular domain I" 
61. .82 

/label= TMI I 

/note= "transmembrane domain II" 
83 . . 99 

/label= ECDI 

/note= "extracellular domain. I" 
100. .119 
/label= TMI II 

/note= "transmembrane domain III" 
120 . . 141 
/label= ICDII 

/note= "intracellular domain II" 
142 . . 161 
/label= TMDIV 

/note= "transmembrane domain IV" 
162... . 183 
/labels ECDII 

/note= "extracellular domain II" 
184. .207 
/label= TMDV 

/note= "transmembrane domain V" 
208 . .233 
/label= ICDIII 

/note= "intracellular domain III" 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



Domain 



234 . .256 
/labels TMDVI 

/note= "transmembrane domain VI" 
257 . .276 
/label= ECDIII 

/note= "extracellular domain III" 
277 . .300 
/label= TMDVI I 

/note= "transmembrane domain VII" 
301. .334 

/label= C-terminal_domain 



Domain 



Domain 



Domain 



XX 

PM WO9720045-A2 . 
XX 

PD 05-JUN-1997. 

XX 

PF 08-NOV-1996; 96WO-US18175 . 
XX 

PR 15-NOV-1995; 95US-0559524 . 

PR 15-NOV-1995; 95US - 00 06732 . 
XX 

PA (CORT-) COR THERAPEUTICS INC. 

XX 

PI Conley PB, Jantzen II; 
XX 

PR WPI; 1997-310601/2 3. 

DR N-PSDB; AAT71900. 
XX 

PT New isolated purinergic receptor sub -type - used to develop - • . 

PT products for diagnosis and therapy , . e . g . - for screening for agonists. 

PT and antagonists which can modulate activation 

XX 

PS Claim 1; Fig 1A-B; 3 6pp; English. 
XX 

CC P2U2 receptor (AAW19854) is a novel human purinergic receptor 

CC subtype that is abundantly expressed in kidney and in many cell 

CC lines of megakaryocytic or erythroleukaemic origin and which is 

CC activated by ATP, UDP, UTP and UDP. Its amino acid sequence was 

CC deduced from a cDNA clone derived from DAM I (ATCC CRL 9792) cells. 

CC P2U2 and its polypeptides can be expressed in host cells and used 

CC to develop diagnostic and therapeutic agents. Antagonists and 

CC agonists based on the extracellular domains of P2U2 receptor, or 

CC which affect receptor function by binding to one of the 

CC intracellular domains, can be used to treat diseases caused by 

CC aberrant activation of this receptor or to treat diseases whose 

CC symptoms can be ameliorated by stimulating or inhibiting the 

CC activity of the receptor. 
XX 

SQ Sequence 334 AA; 

Query Match 74.3%; Score 1226.5; DB 18; Length 334 ; 
Best Local Similarity 71.5%; Pred. No. 6.2e-115; 

Matches 226; Conservative 42; Mismatches 47; Indels 1; Gaps 

Qy 1 MAQNLSCENWLATEAILMKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 



Do 




Qy 61 LS I SDFAFLCTLP I L I KS YANDKGTYGDVLC I 5NRYVLHTNL YTS I LFLTF I SMDR YLLM 12 0 

Ihll lillllhlhllll IIIMMIIIIIII llllllllllllhlllh: 

Db 65 LS VSDLAFLCTLPML IRS YANGNWI YGDVLC I SNRYVLHANL YTS I LFLTF I S IDRYL I I 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 18 0 

I MINI II M II Mill MM llllhlhl III : h I hi 1 1 hi = 1 

Db 12 5 KYPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qy 181 LIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSIL 24 0 

MlhllMIIIIIII lllllllh :|lh|::| MlllhM ||::|||llhl 
Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVL 244 

Qy 241 FTPYHIMRNLRI ASRLDSWPQ - GCTQKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYRE 299 

I I ! I I • I I I H I I I I I II I III I I I HI I I I II I I I - I I I I • I | | : | : 
Db 24 5 FTPYHVMRNA/RIASRLGSWKQYQCTQWINSFYIVTRALGFLNSVINPVFYFLLGDHFRD 3 04 

Qy 300 ML I S KFRQYFKSLTS F 315 

lh- I MM: 

Db 3 05 MLMNQLRHNFKSLTSF 32 0 



RESULT 10 
AAW22732 

ID AA.W22732 standard; Protein; 3 34 AA. 
XX 

AC ?\AN2273 2; 
XX 

■ DT 07-OCT-1997 (first entry) 
XX- , ; 

DE H^iman ATP receptor. 
XX 

KW ATP receptor; G-protein coupled receptor; agonist; antagonist: 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT Misc-dif f erence 212 

FT /note= "encoded by TCC" 

FT Misc-diff erence 235 

FT /note= "encoded by TCG" 

FT Misc-diff erence 244 

FT /labels Unknown 

FT /note= "encoded by CYT" 

XX 

PN W09/24929-A1 . 
XX 

PD 17-JUL-1997. 
XX 

PF 11 -JAN- 7.996; 96WO-US00392 . 
XX 

PR 11- JAN- 1996; 96WO-US00392 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 

XX 

PI Li Y; 
XX 



DR WPI; 1997-372505/34. 

DR N-PSDB; AAT75146. 
XX 

PT Isolated human ATP receptor - agonists and antagonists of which are 

PT useful in treatment of, e.g. asthma, hypertension, arterial 

PT thrombosis and psychotic and neurological disorders 
XX 

PS Claim 15; Fig 1A-C; 53pp; English. 
XX 

CC Human ATP receptor (AAW22732) is structurally related to the G 

CC protein-coupled receptor family. It shows 29.8% identity to a 

CC murine P2u receptor. Its amino acid sequence was deduced from a 

CC human placental cDNA clone (AAT7514 6) . Recombinant ATP receptor can. 

CC ' be expressed in bacterial (e.g. E. coli) , mammalian (e.g. COS) or 

CC insect (e.g. Sf9) host cells and used to screen for agonists and 

CC antagonists useful in the treatment of conditions related to 

CC underexpression of the receptor (e.g. asthma, Parkinson's disease, 

CC acute heart failure, hypotension, urinary retention and 

CC osteoporosis) or underexpression of the receptor (e.g. arterial 

CC thrombosis, hypertension, thrombolysis, angioplasty, cystic 

CC fibrosis, ulcers, asthma, allergy, benign prostatic hypertrophy, 

CC psychotic and neurological disorders, dyskinesias, endogenous 

CC anorexia and bulimia) . 

XX 

SQ Sequence 334 AA; - 

Query Match 73.9%; Score 1219.5.; DB 18; Length 334; 

Best Local Similarity 71.2%; Pred . No. 3.1e-114; 

Matchas 225; Conservative * 43; Mismatches 47; Indels 1; Gaps,- i 

Qy 1 MAQNLS CENWLATEAI LNKY YL S AF Y AI E F I FGLLGNVT WFGYLFCMKNWNS SNV YLFN 60 

II I : h ! I I I II I M I I I II 111= Mill I hi hi MIMIIhlMI 

Db 5 M AWNATC KNWL AAE AALE KY YL S I F YG I E F WGVLGNT I WYG Y I F S L KNWNS SN I YL FN 6 4 

Oy 61 LS I SDFAFLCTLP IL I KS YANDKGT YGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLM 120 

Ihll llllllhlhllll IlilllllllMII llllllllllllhlllh: 

Db 65 LSVSDLAFLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLII 124 

Qy 121 KYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHN 180 

Ml MM Mill 1111111 = 1 llllhlhl III = h I hi 1 1 hi =1 

Db 125 KYPFREHLLQKKECAILISLAMWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYN 184 

Qv 181 LIYSLCLTLLGFLIPLSWCFFYYKMVVFLKRRSQQQATALPLDKPQRLWLAWIFSIL 240 

III hill II MM I I IIMMIh Ml hhM lllllhll I hM Mill: 

Db 185 LIYSMCLTLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVX 244 

Qv 241 FTPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE 2 99 

MllhllhMIIM II I III I I I Mlhlllll MhlMhllhh 

Db 24 5 FTPYHVMRNVRIASRLGSWKQYQCTQWINSFYIVTRPVAFLNSVINPVFYFLVGDHFRD 3 04 

Qy 3 00 MLISKFRQYFKSLTSF 315 

I |::= I INI II I 
Db 3 05 MLMNQLRHNFKSLTSF 32 0 



RESULT 11 
AAU3102 9 



ID AAU31029 standard; Protein; 387 AA. 
XX 

AC AAU3102 9; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #1520. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration ; 

KW immune suppression; immune stimulation; anti -inflammatory ; leukaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO200179449-A2 . 

XX 

PD 25-OCT-2001. 
XX 

?F 16-APR-2001; 2 0 0 1WO-US0 8 65 6 . 
XX 

PR 18-APR-2000; 2 000US- 0552 92 9 . 

PR 26-JAN-2001; 2 00 1US- 0770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI ' Tai;g YT, Liu C, Drmanac RT; - 
XX " ' ■ 

DR WPI; 2 001-61172 5/7 0 . 
XX 

FT Nucleic ■ acids encoding a range of human polypeptides, useful in gerietic 

FT vaccination, t e s t i ng and t he r apy - 

XX . • 

PS Claim 20; Page 3 92; 7 65pp; English. 

XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 

CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising 

CC the nucleic acids encoding the polypeptides and cells genetically 

CC engineered to express them are also useful for producing the proteins. 

CC The proteins are useful in genetic vaccination, testing and 

CC therapy, and can be used as nutritional supplements. They may be used to 

CC increase stem cell proliferation; to regulate haematopoiesis; and in 

CC bone, cartilage, tendon and/or nerve tissue growth or regeneration; 

CC immune suppression and/ or stimulation; as anti- inflammatory agents; and 

CO in treatment of leukaemias. AAU29510-AAU33304 represent the amino acid 

CC sequences of novel human secreted proteins of the invention. 

XX 

SO Sequence 387 AA; 



Query Match 73.7%; Score 1216.5; DB 22; Length 3 87; 

Best Local Similarity 71.1%; Pred. No. 7.6e-114; 

Matches 224; Conservative 43; Mismatches 47; Indels 1; Gaps 



1 



Qy 2 AQNLSCENWLATEAILNKYYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNL 61 

I I =| = = IM III Mill M llh hill I hi hi :||||||hlllll 
Db 59 AWNATCKHWL AAEAALE KY YL S I F YG I E F WG VLGNT I WYG Y I F S LKNWNS SN I YLFNL 118 

Qy 62 SISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMK 121 

hll llllllhlhllll MIMIMIIIMI III III Mill IhlllhM 

Db 119 S VSDLAFLCTLPML IRS YANGNW I YGDVLC I SNRYVLHANL YTS I LFLTF I S IDRYLI I K 178 

Qy 122 YPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNL 181 

IIMII IMIIIIIIIIIh! Mllhlhl II I : h I hMlhl Ml 

Db 17 9 YPFREHLLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNL 23 8 

Qy 182 IYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAWIFSILF 241 

Ilhlllllll III III lllh Ml hhM IMIIhM I hM I II I hll 
Db 23 9 IYSMCLTLLGFSIPLFVMCLFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVLF 2 98 

Qy 242 TPYHIMRNLRIASRLDSWPQ-GCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREM 300 

lllhll hi III II II I III I I I M 1 1 1 M 1 1 1 1 1 1 M I I h 1 1 h h I 

Db 2 99 TPYHVMRNVRIASRLGSWKQYQCTQWIMSFYIVTRPLAFLNSVINPVFYFLLGDHFRDM 3 58 

Oy 301 LISKFRQYFKSLTSF 315 

h- I MMIII 

Db 3 59 LMNQLRHNFKSLTSF 373 



P'SSULT 12 
AYB45376 

TO AAB45376 standard; Protein; 2 5 S AA , 

XX ' 

AC AAB45376; . ' 

XX 

DT 14-FEB-2001 (first entry) .. ■ : . 

XX 

DE Hufiian secreted protein sequence encoded by gene 37 SEQ ID NO: 128. 

::x 

KW Human; secreted protein; diagnosis; immunosuppressive; antiarthritic ; 

KW antirheumatic; antiproliferative; cytostatic; cardiant; vasotropic; 

KW cerebroprotective ; nootropic; neuroprotective; antibacterial; virucide; 

KW fungicide; ophthalmological ; vulnerary; gene therapy; autoimmune disease; 

KW hyperprolif erative disorder; cardiovascular disorder; angiogenesis ; 

KW cerebrovascular disorder; nervous system disorder; infection; skin aging; 

KW ocular disorder; wound healing; food additive; preservative. 

XX 

03 Homo sapiens. 
XX 

PN WO200061628-A1 . 
XX 

?D 19-OCT-2000. 
XX 

PF 06-APR-2000; 2000WO-US09070 . 
XX 

PR 03-APR-19S9; 99US -012 8695 . 

PR 14-JAN-2000; 2000US -0176052 . 
XX 

PA (HTJMA-) HUMAN GENOME SCI INC. 



PI Rosen CA, Ruben. SM, Komatsoulis G; 
XX 

DR WPI; .2000-619228/59. 
XX 

PT New nucleic acid molecules encoding 49 human secreted proteins for 

PT diagnosing, preventing, treating or ameliorating medical conditions and 

PT used as food additives or preservatives - 

XX 

PS Disclosure; Page 447-448; 454pp; English. 
XX 

CC The polynucleotide sequences given in AAC81086 to AAC81134 encode the 

CC human secreted proteins given in AAB45308 to AAB45356. AAB45357 to 

CC AAB45384 represent human secreted polypeptide sequences and proteins 

CC homologous to them, which are given in the exemplification of the present 

CC invention. Human secreted proteins have activities based on the tissues 

CC and cells the genes are expressed in. Examples of activities include: 

CC antiarthritic; immunosuppressive; antirheumatic; antiproliferative;- 

CC cytostatic; cardiant; vasotropic; cerebroprotective ; nootropic; 

CC neuroprotective; antibacterial; virucide; fungicide; ophthaimological ; 

CC and. vulnerary. The polynucleotides and polypeptides can be used to 

CC prevent, treat or ameliorate a medical condition in e.g. humans, mice, 

CC rabbits, goats, horses, cats, dogs, chickens or sheep. They are also used 

CC in diagnosing a pathological condition or susceptibility to a - 

CC pathological condition. Disorders which are diagnosed or treated include 

CC autoimmune diseases, hyperprolif erative disorders, cardiovascular 

CC disorders, cerebrovascular disorders; angiogenesis , nervous system. 

CC disorders, infections caused by bacteria, viruses and fungi and ocular 

CC disorders. The polypeptides can also be used to aid wound healing and 

CC epithelial cell proliferation/ to prevent skin aging due to sunburn., to . 

CC maintain organs before transplantation, for supporting cell culture of 

CC primary tissues, to regenerate tissues and in chemotaxis. The :. 

CC polypeptides can also fc>e used as a food additive or preservative co 

CC increase or decrease storage capabilities, fat content, lipid,, protein, 

CC carbohydrate, vitamins, minerals, cof actors and other nutritional 

CC components .■ AAC81077 to AAC81085 and AAB45307 represent sequences used . in 

CC the exemplification of the present invention. 

XX 

SQ Sequence 2 58 AA; 

Query Match 61.5%; Score 1014.5; DB 21; Length 258; 

Best Local Similarity 72.9%; Pred. -No. 9.7e-94; 

Matches 188; Conservative 33; iMismatches 36"; Indels 1; Gaps 1; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFA-FLCTLPILIKSY 79,- 

III I 11 llh hill I hi hi MMIMhlMMhM I I II II hi hM 
D'ib 1 YYLS1FYGIEFWGVLGNTIWYGYIFSLKNWNSSNIYLFNLSVSDLAFLCTLPMLIP.SY 60 

Qy 30 ANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139' 

II II I II Mill II I I MMMMMMhlllhMIMlM II I Mi I I M I 

Db 61 ANGNWI YGDVLC I SNRYVLHANLYTS I LFLTF IS IDRYLI I KYPFREHLLQKKE FAIL I S 120 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGMPEHNLIYSLCLTLLGFLIPLSVM 199 

I hi Mllhlhl II I : h I hlllhl MIMhMMMMIM II 

Db 121 LAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNLIYSMCLTLLGFLIPLFVM 180 

Qy 200 CFFYYKMVITLKRRSQQQATALPLDKPQRLVVLAVVIFSILFTPYHIMRNLRIASRLDSW 259 

II M I h : ! I h h : ! II I M h II I h M I II I h I II I! h I I h M M II !l 



Db 181 CFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAWIFSVLFTPYHVMRNVRIASRLGSW 24 0 

Qy 260 PQ - GCTQKAI KS I YTLTR 276 

I III I I I HI 
Db 241 KQYQCTQWINSFYIVTR 25 8 

RESULT 13 
ABP54317 

ID ABP54317 standard; Protein; 373 AA. 

XX 

AC ABP54317; 
XX 

DT 16-JAN-2-003 (first entry) 
XX 

DE Rat P2Y purinoceptor 1 protein SEQ ID NO : 7 . 
XX 

KW Human; G protein coupled receptor; GPCR; HGPRBMY2 7 ; ant i inflammatory ; 

KW antiinf ertility ; pulmonary; cytostatic; nephro tropic ; hormonal; 

KW circulatory; gene therapy; inflammatory disorder; reproductive disorder; 

KW pulmonary disorder; cancer; renal disorder; connective tissue disorder; 

KW endocrine disorder. ' 

XX 

CS Rattus norvegicus . 

XX - 

PN WO2G027?755-A2 . 

XX 

PD 13-SEP-200? . ■ ' 

XX - 

•;-F 06-MAR-.2002; 2 002WO-US06796 ■. ' 

XX 

PR 07-MAR-2001; 2 00 1US -273 8 08P . , ' 

:-R 2 7-MAR-2 001; 2C01US-278 983P . ' 
XX * 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Ramanathan C, Feder J, Mintier G, Cacace A, Barber L; 
XX 

DR WPI; 2002-657945/70. 
XX 

PT" New polynucleotide encoding a human G-protein coupled receptor for 

PT preventing, treating, or ameliorating e.g. an inflammatory, 

PT reproductive, pulmonary, renal connective tissue, or endocrine disorder 

XX 

I :, S Disclosure; Fig 2A-B; 356pp; English. 
XX 

CC The present invention describes a human G protein coupled receptor 

CC (GPCR) , designated HGPRBMY2 7 (I). (I) has antiinflammatory, 

CC antiinf ertility , pulmonary, cytostatic, nephrotropic , hormonal and 

CC circulatory activities, and can be used in gene therapy. (I) or the 

CC protein encoded by it can be used to prevent, treat, or ameliorate a 

CC medical condition, such as inflammatory disorders, reproductive 

CC disorders, pulmonary disorders, cancer, renal disorders, connective 

CC tissue disorders, endocrine disorders, or disorders involving 

CC aberrations in tubular tissues . They can also be used to diagnose a 

CC pathological condition or a susceptibility to (I) . The protein can 



CC be used to screen for candidate compounds capable of modulating activity 

CC of a GPCR polypeptide. The present sequence represents a GPCR given in 

CC comparison with the HGPRBMY27 protein in the exemplification of the 

CC present invention. 
XX 

SQ Sequence 3 73 AA; 

Query Match 29.0%; Score 479; DB 23; Length 373; 

Best Local Similarity 37.3%; Pred. No. 1.3e-39; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps 6 

QY 2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 7 9 

III I I : II I Ml =: ::| I I h = ll = ll|: = :|| =: II! II I 
Db 52 Y YLPAVYI LVF I IGFLGNS VAI WMFVFHMKPWSGI S VYMFNLALADFL YVLTLFAL I F YY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTS1LFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I =111 = 1 l = = I Ml ill III II II = II = 11=11 = 

Db 112 FNKTDWIFGDV1VICKLQRFIFHVNLYGSILFLTCISAHRYSGVVYPLKSLGRLKKKNAIYV 171 

Qy 13 9 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIFLS 197 

1= II =1 = : | = l = : : | : | | | |||:| |= | ||| 

Db 172 SVDVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 231 

Qy "- : :9-8 "VMCFFYYKMVVFLKRRSQQQATALPL-DKPQRLVVLAVVIFSILFTPYHIMRNLRiASRL 2 56 

:: | =| | : = || | ||:: : :|:= : | = |:|:.= : ,|[ 

Db 232 L ILGCYGLI VFA.LIYKDLDNS — - PLRRKS IYLVT IVLTVFAVSYIPFHVMKTMNLRARL 288 



D - 3WPQGCT - QKAI K3 I YTLTRPLAFLNS AINPI FYFLMGDHYREML 3 01 

I !.= I = : I : I I II III = = 11111 M ■ I I 

2 89 DFQTPEMCDFNDRVYATYQVTRGTjASLNSCVDPILYFLAGDTFRRRL 3 35 



RESULT 14 
ABU119 03 

■ i - 

J.D ABU11903 standard; Protein; 373 AA. 
XX 

AC ABU11903; 
XX 

DT 13-FEB-2003 (first entry) 
XX 

DE Rat purinergic receptor. 
XX 

KW Receptor; HGPREMY11; HGPRBMYllvl ; HGPRBMYllv2; GPCR74; GPCR81; 

KW G^protein coupled receptor; cardiovascular disease; arrhythmia ; 

KW myocardial infarction; congestive heart failure; cardiomyopathy; 

KW atherosclerosis; arteriosclerosis; embolism; angina; thrombosis; 

KW hypertension; Alzheimer's disease; Parkinson's disease; osteoporosis ; 

KW obesity; human immunodeficiency virus infection; HIV; schizophrenia; 

KW sleeplessness; acquired immunodeficiency syndrome; AIDS; leukaemia; 

KW sepsis; inflammation; psoriasis; Gaucher 's disease; ischaemia_. 
XX 

OS Rattus uorvegicus . 

XX 

PN WO200286123-A2 . 
XX » 

?D 3 1- OCT- 2 0 02 . 
XX 



PF 16-NOV-2001; 2 0 0 1W0-US4 4 0 1 9 . 
XX 

PR 17-NOV-2000; 2 000US-249613P . 

PR 21-DEC-2000; 2 0 0 OUS - 2 5 7 61 IP . 

PR 16-JUL-2001; 2 001US-305818P . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Feder J, Nelson TC, Ramanathan C, Cacace AM, Barber LE; 
XX 

DR WPI; 2003-093137/08. 
XX 

PT New human G-protein coupled receptor HGPRBMY11 polypeptide or 

PT polynucleotide, useful for preventing, treating or ameliorating e.g. 

PT myocardial infarction, angina, thrombosis, Alzheimer's disease, 

PT schizophrenia, AIDS , leukemia - 

XX 

?S Disclosure; Fig 2; 444pp; English. : 
XX 

CC The invention relates to an isolated polypeptide (designated HGPRBMY11) , 

CC which has a G-protein coupled receptor (GPCR) activity (also known 

CC as GPCR74 or GPCR81) and is encoded by the cDNA contained in ATCC Deposit 

CC Number PTA-2766, its variants ( HGPRBMY1 lvl and HGPRBMYllv2) , fragments, 

CC domains, species homologues and proteins 95% similar to it. Also included 

CC are: (1) the nucleic acids encoding the HGPRBMY11 proteins (including 

00 variants, fragments, complements and sequences 95% similar to the 

CC HGPRBMY11 nucleic acids); (2) recombinant vectors; (3) transformed host 

CC cells expressing KGPRBMY11; (4 : ) an ant i - HGPRBMY 1 1 antibody; (5;) 

CC diagnosing a pathological condition or a susceptibility to a pathological 

CC condition in a subject; (6) identifying a binding partner to the 

CC HGPRBMY11 polypeptide; (7) identifying an activity in a biological ;assay,-" 

CO (8) a process for making polynucleotide sequences encoding a gene product 

CC having altered GPCR activity; (9) a shuffled polynucleotide produced by 

CC izhe method of (8) ; and (10) screening for candidate compounds capable of 

CC binding to and/or modulating activity of a GPCR. The polypeptide or the 

CC polynucleotide is useful for preventing, treating or ameliorating a 

CC medical condition, particularly cardiovascular diseases or disorders, 

CC e.g. myocardial infarction, congestive heart failure, arrhythmias, 

CC cardiomyopathy, atherosclerosis, arteriosclerosis, embolism, angina, 

CC thrombosis or hypertension). The HGPRBMY11 polypeptide or polynucleotide 

CC is also useful for treating, preventing or ameliorating e.g. Alzheimer's 

CC disease, Parkinson's disease, osteoporosis, obesity, human 

CC immunodeficiency virus (HIV) infections, schizophrenia, sleeplessness, 

CO acquired immunodeficiency syndrome (AIDS), leukaemia, sepsis, 

CC inflammations, psoriasis, Gaucher ' s disease or ischaemia (many 

CC other diseases and disorders are listed in the specification) . 

CO The present sequence represents a homologue of an HGPRBMY11 protein 

CC (or variant) . 

XX 

SO Sequence 3 73 AA; 

Query Match 2 9.0%; Score 4 79; DB 24; Length 3 73; 

Best Local Similarity 37.3%; Pred. No. 1.3e-39; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps .6; 



2 0 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 



Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 00 AND KG - T YGDVLC I SNRYVLHTNLYTS I LFLTF I SMDRYLLMKYPFREHFLQKKE FAI L I 138 

I =1-11 = 1 I I III III III II II = II : 11= II 

Db 112 FNKTDW I FGD VMCKLQRF I FHVNLYGS I LFLTC I S AHRYSG WYPLKSLGRLKKKNAI YV 171 

Qy 139 SLAVWALVTLEVLPMLTFINS -VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

1 = II =1 = = hi = = : I = III 111 = 1 1= I III 

Db 172 SVLVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

1=11= =111 ll== = =l = = = 1 = 1 = 1= = = =11 
Db 232 L I LGC YGL I VRAL I YKDLDNS PLRRKS 1 YLVI I VLTVFAVS YI PFHVMKTMNLRARL 288 

Oy 2 57 D - S WPQGCT - QKAI KS I YTLTRPLAFLMS AINP I F YFLMGDHYREML 301 

I 1=1 = = I =11 II III = = ll III II =1 I 

Db 2 89 DFQTPEMCDFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 33 5 

RESULT 15 

AAE04389 

ID AAE04389 standard; Protein; 373 AA. 
XX 

AC AAE043 89; 
XX 

DT 04-SEP-2001 (first entry) 
XX 

:0E Hurnau P2 -purinergic receptor subtype, P2Y1. L. 

AA ... • 

J.W Human; P2 -purinergic receptor; 1 P2Y1; cardiant; vasotropic; thrombolytic; 

XW cerebroprotective; gynaecological; ADP; adenosine 5 ' -diphosphate ; angina; 

. KW myocardial infarction; ischaemic attack; preeclampsia; bleeding disorder ; 

XW carotid endarterectomy ,- vascular graft surgery; brain disorder; migraine; 

KW vascular injury; schizophrenia; eating disorder; depression; angioplasty; 

KW peripheral vascular disease; platelet aggregation; restenotic; embolism; 

KW thrombocytopaenic purpura; stroke; pertussis toxin- sensitive G protein; 

KW Gi; disseminated intravascular coagulation; thrombosis. 
XX 

OS Homo sapiens . 

XX 

FH Key Location/Qualifiers 

FT Domain 53 . . 7 6 

FT /label= Transmembrane_domain_l 

FT Domain 84.. 109 

FT /label= Transmembrane_domain_2 

FT Domain 124.. 148 

FT /iabel= Trans membrane_domain_3 

FT Domain 167.. 190 

FT /label= Trans membrane_domain_4 

FT Domain 217 . .241 

FT /label= Transmembrane_domain_5 

FT Domain 258 . .283 

FT ,/label= Transmembrane_domain__6 

FT Domain 304 . . 32 9 

FT /labels Transmembrane_domain_7 

XX 

PN WO20C146454-A1 . 



r ■ i 



XX 

PD 28-JUN-2001. 

XX 

PF 26-DEC-2000; 2000WO-US34998 . 
XX 

PR 23-DEC-1999; 99US-0171622 . 
XX 

PA (GORT-) COR THERAPEUTICS INC. 
XX 

PI Conley PB, Jantzen H, Ramakrishnan-Dubridge V, Julius DJ/ 

PI Hollopeter G; 
XX 

DR WPI; 2001-418082/44. 
XX 

PT Novel isolated ADP receptor, termed P2Y12 receptor polypeptide, useful 

PT for identifying binding partners and for diagnostic applications 
XX 

PS Disclosure; Fig 5A; 108pp; English. 

XX 

CC The invention relates to ADP (adenosine 5 ' -diphosphate) receptor, termed 

CC as P2Y12 receptor and its corresponding cDNA molecule. P2Y12 receptor is 

CC the subtype of P2 -purinergic receptor. The P2Y12 receptor is expressed 

CC selectively in the platelets and brain, and couples to a pertussis toxin- 

CC sensitive G protein (Gi) . P2Y12 receptor is a G protein-coupled receptor 

CC that responds to ADP . The invention also relates to a method for 

CC identifying an agent which is useful for modulating acute myocardial >-% 

CC infarction, unstable angina, chronic stable angina, transient 'ischaemic . ■ ■ i 

CC attacks, strokes, peripheral . vascular disease, preeclampsia, deep venous , 'jf 

CC thrombosis, embolism, disseminated intravascular coagulation, '-Thrombotic |\ : '.' £ 

CC ^hrombocytopaenic purpura or a bleeding disorder; thrombotic and t\ : 

CC restenctic complications following angioplasty, carotid endarterectomy , m ; '• . 

CC post CABG (coronary artery bypass graft) surgery, vascular graft surgery, .: ^ 

CC stent placements or insertion of endovascular devices and prostheses.. 

CC P2Y12 receptor is useful for identifying binding partners and for . " ' ; V * • ^.\ r ; r 

CC diagnostic applications. P2Y12 receptor provides targets for screening .. ; 

CC synthetic small molecules and combinatorial or naturally occurring . 1 . 

CC compound libraries to regulate platelet aggregation, vascular injury, or 

CC disease as well as schizophrenia, eating disorders, depression, migraine 

CC , and other brain disorders. The present sequence is human P2 -purine rgic . 

CC receptor subtype, P2Y1 related to the invention. - ' 
XX 

V.Q Sequence 373 AA; 

Query Match 28.7%; Score 474; DB 22; Length 373; 
Best Local Similarity 37.3%; Pred . No. 4.2e-39; 

Matches 1.07; Conservative 55; Mismatches 117; Indels 8; Gaps : 6; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

MM I : II I III :: =:| || |: :||:|||:::|| == ||| ||. | 

Db 52 YYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qv SO ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFI3MDRYLLMKYPFREHFLQKKEFAILI 13 8 

I = 11 =| | = = I III MINI II II = II = 11= ! I I 

Db 112 FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLRKKNAICI 171 . 

QY 139 SLAVWALVTLEVLPMLTFINS - VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 
h II =1 = = hi = =11= II I I I hi f: 1=11 



Db 172 SVLVWLIAT/VAISPILFYSGTGVRKMKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

QY 19 8 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 2 56 

:: | :| | : = || | || = = : :|:= : |:|:|: : : :|| 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKSIYLVIIVLTVFAVSYIPFHVMKTMNLRARL 288 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAIMPIFYFLMGDHYREML 301 

II I = : I =11 II III : = || IN || :| I 

Db 289 DFQTPAMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 



Search completed: December 12, 2003, 18:07:09 
Job time : 42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: December 12, 2003, 17:58:52 ; Search time 18 Seconds 

(without alignments) 
828.193 Million cell updates/sec 

Title: US-09-891-138A-2 
Perfect score: 1650 

Sequence: 1 MAQNLSCENWLATEAILNKY REMLISKFRQYFKSLTSFRT 317 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


479 


29. 


0 


373 


1 


P2YR_ 


_RAT 


P49651 


rattus norv 


2 


475 


28. 


8 


373 


1 


P2YR] 


"mouse 


P49650 


mus musculu 


3 


474 


28. 


7 


373 


1 


P2YR _ 


EOVin 


P48042 


bos taurus 


4 


474 


28. 


7 


373 


1 


P2YR~ 


_HUMAN 


P47900 


homo sapien 


5 


472.5 


28. 


6 


362 


1 


P2YR] 


_CHICK 


P34996 


gallus gall 


6 


472.5 


28. 


6 


362 


1 


P2YR~ 


MELGA 


P49652 


meleagris g 


7 


386.5 


23. 


4 


537 


1 


P2Y8_ 


_XENLA 


P79928 


xenopus lae 


8 


379.5 


23. 


0 


328 


1 


P2Y3] 


_CHICK 


Q98907 


gallus gall 


9 


374 


22. 


7 


374 


1 


P2Y2] 


_RAT 


P41232 


rattus norv 


10 


373.5 


22. 


6 


373 


1 


P2Y2_ 


MOUSE 


P35383 


mus musculu 


11 


371.5 


22. 


5 


328 


1 


P2Y3_ 


_MELGA 


093361 


meleagris g 


12 


363.5 


22. 


0 


377 


1 


P2Y2~ 


_HUMAN 


P41231 


homo sapien 


13 


354 


21. 


5 


361 


1 


P2Y4~ 


RAT 


035811 


rattus norv 


14 


353 


21. 


4 


355 


1 


C3X1~ 


HUMAN 


P49238 


homo sapien 


15 


352 


21. 


3 


365 


1 


P2Y4~ 


_HUMAN 


P51582 


homo sapien 


16 


343 


20. 


8 


361 


1 


P2Y4] 


_MOUSE 


Q9jjs7 


mus musculu 


17 


336.5 


20. 


4 


360 


1 


CKR4] 


"human 


P51679 


homo sapien 



18 


336 


20. 


4 


328 


1 


P2Y6_RAT 


Q63371 


rattus norv 


19 


333 


20. 


2 


308 


1 


P2Y5_CHICK 


P32250 


gallus gall 


20 


332.5 


20. 


2 


354 


1 


C3X1_RAT 


P35411 


rattus norv 


21 


332 


20. 


1 


32 8 


1 


P2Y6_HUMAN 


Q15077 


homo sapien 


22 


329 


19. 


9 


355 


1 


CKR1__MACMU 


P56482 


macaca mula 


23 


328 


19. 


9 


355 


1 


CKR1_HUMAN 


P32246 


homo sapien 


24 


326.5 


19. 


8 


346 


1 


CLT2_HUMAN 


Q9ns75 


homo sapien 


25 


326.5 


19. 


8 


360 


1 


CKR4_MOUSE 


P51680 


mus musculu 


26 


325.5 


19. 


7 


354 


1 


C3X1_M0USE 


Q9z0d9 


mus musculu 


27 


325. 5 


19. 


7 


388 


1 


SSR4 HUMAN 


P31391 


homo sapien 


28 


319. 5 


19. 


4 


345 


1 


CLT2 PIG 


Q95n03 


sus scrofa 


29 


319 


19. 


3 


353 


1 


AP J_XENLA 


P79960 


xenopus lae 


30 


318.5 


19. 


3 


359 


1 


IL8B__MOUSE 


P35343 


mus musculu 


31 


317.5 


19. 


2 


355 


1 


GPR1_HUMAN 


P46091 


homo sapien 


32 


317 


19. 


2 


368 


1 


SSR2_BOVIN 


P34993 


bos taurus 


33 


316.5 


19. 


2 


384 


1 


S5R4 RAT 


P30937 


rattus norv 


34 


315.5 


19. 


1 


355 


1 


GPR1_MACMU 


097664 


macaca mula 


35 


315.5 


19. 


1 


355 


1 


IL8A_RABIT 


P21109 


oryctolagus 


36 


315 


19. 


1 


355 


1 


CKR8_HUMAN 


P51685 


homo sapien 


37 


314 


19. 


0 


369 


1 


SSR2__HUMAN 


P30874 


homo sapien 


38 


314 


19. 


0 


369 


1 


SSR2_PIG 


P34994 


sus scrofa 


39 


314 


19. 


0 


370 


1 


P2Y9 HUMAN 


Q99677 


homo sapien 


40 


313.5 


19. 


0 


353 


1 


IL8B_PANTR 


Q28807 


pan troglod 


41 


313.5 


19. 


0 


356 


1 


CKR8_MACMU 


097665 


macaca mula 


42 


313.5 


19. 


0 


356 


1 


IL8B CANFA 


097571 


canis famil 


43 


313 


19. 


0 


344 


1 


P2Y5_HUMAN 


P43657 


homo sapien 


44 


312.5 


18. 


9 


360 


1 


IL8B_HUMAN 


P25025 


homo sapien 


45 


311. 5 


18. 


9 


309 


1 


CLT2 MOUSE 


Q920al 


mus musculu 



ALIGNMENTS 



RESULT 1 
P2YR_RAT 

ID P2YR_RAT STANDARD; PRT; 373 AA. 

AC P49651; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 01-FEB-1996 (Rel. 33, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) . 

GN P2RY1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Verteforata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Insulinoma; 

RX MEDLINE=95298025; PubMed=7779087 ; 

RA Tokuyama Y., Hara M. , Jones E.M.C., Fan Z., Bell G.I.; 

RT "Cloning of rat and mouse P2Y purinoceptors . " ; 

RL Biochem. Biophys . Res. Commun. 211:211-218(1995). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 
CC ATP AND ADP . IN PANCREATIC ISLETS, MAY MEDIATE SOME OF THE EFFECTS 

CC OF EXTRACELLULAR ATP ON INSULIN SECRETION. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 



CC -!- TISSUE SPECIFICITY: EXPRESSED IN MUSCLE, HEART, LIVER, KIDNEY, 
CC LUNG, BRAIN, SPLEEN, BUT NOT IN TESTIS . 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U22830; AAA91303.1; 

DR HSSP; P34996; 1DDD . 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PROSITE; 


PS00237; 


G_PROTEIN 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


GJPROTEIN 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor ; 


Transmembrane; Glycoprotein. 


FT 


DOMAIN 


1 


52 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


53 


74 


1 (POTENTIAL) . 


FT 


DOMAIN 


75 


87 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


88 


109 


2 (POTENTIAL). 


FT 


DOMAIN 


110 


126 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


127 


147 


3 (POTENTIAL) . 


FT 


DOMAIN 


148 


166 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


167 


188 


4 (POTENTIAL) . 


FT 


DOMAIN 


189 


218 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


219 


238 


5' (POTENTIAL) . 


FT 


DOMAIN 


239 


265 


CYTOPLASMIC (POTENTIAL) . 


FT . 


TRANSMEM 


266 


285 


6 (POTENTIAL) . 


FT 


DOMAIN 


286 


303 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


. 304 


328 


7 (POTENTIAL) . 


FT 


DOMAIN 


329 


373 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


124 


202 


BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


27 


27 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


197 


197 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


373 AA; 


42321 MW; 6DDF67 62 87B5E64 8 CRC64; 



Query Match 29.0%; Score 479; DB 1; Length 373; 

Best Local Similarity 37.3%; Pred. No. 1.6e-25; 

Matches 107; Conservative 57; Mismatches 115; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

11111:111111 : : : : I I I I : : I I : I i I : : : I I : : I I I I I I 
Db 52 YYLPAVYI LVFI I GFLGNSVAIWMFVFHMKPWSGI SVYMFNLALADFLYVLTLPALI FYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I I : I I :: I I I I I I I I I I I I II : M : II : M : 

Db 112 FNKTDWIFGDVMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAIYV 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : 1 : I : : : I : II I II I : I I : I I I I 

Db 172 SVLVWLIVWAISPILFYSGTGIRKNKTVTCYDSTSDEYLRSYFIYSMCTTVAMFCIPLV 231 



Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

: : | : I I : .: II I I I : : : : I : : : I : I : I : : : : I I 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKSIYLVI IVLTVFAVSYIPFHVMKTMNLRARL 288 

Qy 257 D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 

I I : I : : I : I I I I I I I : : I I I I I I I : I I 

Db 289 D FQT PEMC D FNDRVYAT YQVT RGLAS LN S CVD P I L YFLAGDT FRRRL 335 



RESULT 2 
P2YR_MOUSE 

ID P2YR_MOUSE STANDARD; PRT; 373 AA. 

AC P49650; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor). 

GN P2RY1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Insulinoma; 

RX MEDLINE=95298025; PubMed=777 9087 ; 

RA Tokuyama Y., Hara M. , Jones E.M.C., Fan Z., Bell G.I.; 

RT "Cloning of rat and mouse P2Y purinoceptors. "; 

RL Biochem. Biophys . Res. Commun. 211:211-218(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/Sv; 

RA Leon C. ; 

RT "Thromboresistance in P2Y1 receptor knockout mice."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/ DDB J databases. 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 

CC ATP AND ADP. IN PANCREATIC ISLETS, MAY MEDIATE SOME OF THE EFFECTS 

CC OF EXTRACELLULAR ATP ON INSULIN SECRETION. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U22829; AAA91302.1; -. 

DR EMBL; AJ245636; CAB57317.1; -. 

DR HSSP; P34996; 1DDD. 

DR MGD; MGI: 105049; P2ryl- 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN_ 


~RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor ; 


Transmembrane; Glycoprotein. 


FT 


DOMAIN 


1 


52 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


53 


74 


1 (POTENTIAL). 


FT 


DOMAIN 


75 


87 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


88 


109 


2 (POTENTIAL) . 


FT 


DOMAIN 


110 


126 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


127 


147 


3 (POTENTIAL). 


FT 


DOMAIN 


148 


166 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


167 


188 


4 (POTENTIAL) . 


FT 


DOMAIN 


189 


218 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


219 


238 


5 (POTENTIAL) . 


FT 


DOMAIN 


239 


265 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


266 


285 


6 (POTENTIAL) . 


FT 


DOMAIN 


286 


303 


EXTRACELLULAR (POTENTIAL) . 


r 1 




a 

Z>\J H. 


328 


7 (POTENTIAL) . 


FT 


DOMAIN 


329 


373 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


124 


202 


BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


27 


27 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) ( POTENTIAL) 


FT 


CARBOHYD 


197 


197 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


373 AA; 


42212 MW; 944125E9F4560BB3 CRC64; 


Query Match 




28.8%; 


Score 475; DB 1; Length 373; 



Best Local Similarity 37.3%; Pred. No. 3e-25; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 7; 

Qy 20 YYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I S DFAFLCTLPI LI KS Y 79 

|||||:|||||| : : : : I I I I : : ] I : I I I : : : I I : : I I I I I I 
Db 52 YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVTMFNLALADFLYVLTLPALIFYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I : : I I I I I I I I I I I I II : I I : I I : I I : 

Db 112 FNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAIYV 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : I : II I : I I I : I I : I I I I 

Db 172 SVLVWLIWVAISPILFYSGTGTRKNKTVTCYDTTSNDYLRSYFIYSMCTTVAMFCIPL- 230 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

| : | : : | I II I | | : : : : | : : : |: I : I : : : : I I 

Db 231 VLILGCYGLIV — KAL I YN DLDNSPLRRKSIY L VI I VLT VFAVS YIP FHVMKTMN L RARL 288 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I I : I : : I : I I I I I I I : : M II I I I : I I 

Db 289 D FQT P EMC D FNDRVYAT YQVT RGLAS LN S CVD P I L Y FLAGDT FRRRL 335 



RESULT 3 
P2YR_BOVIN 

ID P2YR_BOVIN STANDARD; PRT; 373 AA. 

AC P48042; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 



DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) . 

GN P2RY1. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia ; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N. A. 

RC TISSUE=Aortic endothelium; 

RX MEDLINE=95 352 05 8 ; PubMed=7 62 607 9 ; 

RA Henderson D.J., Elliot D.G., Smith G.M., Webb T.E., Dainty I. A.; 

RT "Cloning and characterisation of a bovine P2Y receptor."; 

RL Biochem. Biophys . Res. Coitimun. 212:648-656(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Corpus callosum; 

RX MEDLINE=99 0 64 5 62 ; PubMed=9 8 4 8 09 6; 

RA Deng G. , Matute C, Kumar C.K., Fogarty D.J., Miledi R. ; 

RT "Cloning and expression of a P2y purinoceptor from the adult bovine 

RT corpus callosum."; 

RL Neurobiol. Dis. 5:259-270(1998). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 
CC ATP AND ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS . 

CC 

CC This SWISS- PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X87628; CAA60958.1; 

DR EMBL; U34041; AAC78275.1; -. 

DR PIR; JC4162; JC4162. 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN_ 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein 


FT 


DOMAIN 


1 


52 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


53 


74 


1 (POTENTIAL) . 


FT 


DOMAIN 


75 


87 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


88 


109 


2 (POTENTIAL) . 


FT 


DOMAIN 


110 


126 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


127 


147 


3 (POTENTIAL) . 


FT 


DOMAIN 


148 


166 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


167 


188 


4 (POTENTIAL) . 


FT 


DOMAIN 


189 


218 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


219 


238 


5 (POTENTIAL) . 


FT 


DOMAIN 


239 


265 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


266 


285 


6 (POTENTIAL) . 



FT 


DOMAIN 


286 


303 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


304 


328 


7 (POTENTIAL) . 




FT 


DOMAIN 


329 


373 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


124 


202 


BY SIMILARITY. 




FT 


CARBOHYD 


11 


11 


N-LINKED (GLCNAC. . 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


27 


27 


N-LINKED (GLCNAC. . 


. .) (POTENTIAL). 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . 


, .) (POTENTIAL). 


FT 


CARBOHYD 


197 


197 


N-LINKED (GLCNAC. . 


, . ) (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA; 


422* 


37 MW; 9270A7175C0BDA76 


CRC64; 



Query Match 28.7%; Score 474; DB 1; Length 373; 

Best Local Similarity 36.6%; Pred. No. 3.5e-25; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6; 



Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

|||||:|||||| : : : : | | | | : : I I : I I I : : : I I : : I I I I I I 
Db 52 YYLPAVYI LVFI I GFLGNSVAIWMFVFHMKPWSGI S VYMFNLAIADFL YVLTLPALI FYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I :: I I I I I I I I I I I I II : I I : I I : I : I 

Db 112 FNKTDWI FGDAMCKLQRFI FHVNLYGSI LFLTCI SAHRYSGWYPLKSLGRLKKKNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

| : I I : I : : I : I : : : I : II I I I I : I I : I : I I 

Db 172 S VLVWLI VWGI S P I LFYS GTGI RKNKT I TCYDTT S DEYLRS YFI YSMCTTVAMFCVP LV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVA^LAVVIFSILFTPYHIMRNLRIASRL 256 

: : | : | I : : II I I I : : : : I : : : I : I : I : : : _ : I I 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKS I YLVI I VLTVFAVS YI P FHVMKTMNLRARL 288 

Qy 257 D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 



289 D FQT P EMCAFN DRVYAT YQVT RGLAS LN S CVD P I L Y FLAGDT FRRRL 335 



RESULT 4 


P2YR_ 


HUMAN 


ID 


~P2YR HUMAN STANDARD; PRT; 373 AA. 


AC 


P47900; 


DT 


01-FEB-1996 (Rel. 33, Created) 


DT 


01-FEB-1996 (Rel. 33, Last sequence update) 


DT 


28-FEB-2003 (Rel. 41, Last annotation update) 


DE 


P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) . 


GN 


P2RY1. 


OS 


Homo sapiens (Human) . 


OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 


OX 


NCBI TaxID=9606; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


TISSUE^Placenta; 


RX 


MEDLINE=96257237; PubMed=8 6662 90 ; 


RA 


Leon C, Vial C, Cazenave J. -P., Gachet C; 


RT 


"Cloning and sequencing of a human cDNA encoding endothelial P2Y1 


RT 


purinoceptor . " ; 


RL 


Gene 171:295-297 (1996) . 


RN 


[2] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=96158962; PubMed=8579591 ; 

RA Ayyanathan K., Tania W., Harbansjit Raghbir A.S., Barnard E.A. , 

RA Kunapuli S.P.; 

RT "Cloning and chromosomal localization of the human P2Y1 

RT purinoceptor . " ; 

RL Biochem. Biophys . Res. Commun. 218:783-788(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96205320; PubMed-8 630005 ; 

RA Janssens R., Communi D., Pirotton S., Samson M. , Parmentier M. f 

RA Boeynaems J.M.; 

RT "Cloning and tissue distribution of the human P2Y1 receptor."; 

RL Biochem. Biophys. Res. Commun. 221:58 8-593(1996). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Blood; 

RA Leon C, Vial C, Weber J., Cazenave J. -P., Gacher C. ; 

RL Submitted (JUN-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE OF 95-373 FROM N.A. , FUNCTION, AND INHIBITION STUDIES. 

RC TISSUE=Platelet; 

RX MEDLINE=98113162; PubMed-944204 0 ; 

RA Jin J. , Daniel J.L., Kunapuli S.P.; 

RT "Molecular basis for ADP-induced platelet activation. II. The P2Y1 

RT receptor mediates ADP-induced intracellular calcium mobilization and 

RT shape change in platelets."; 

RL J. Biol. Chem. 273:2030-2034(1998). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 
CC ATP AND ADP . IN PLATELETS BINDING TO ADP LEADS TO MOBILIZATION OF 

CC INTRACELLULAR CALCIUM IONS VIA ACTIVATION OF PHOSPHOLIPASE C, A 

CC CHANGE IN PLATELET SHAPE, AND PROBABLY TO PLATELET AGGREGATION. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- INDUCTION: REPRESSED BY THE P2Y1 RECEPTOR- SPECIFIC ANTAGONISTS 
CC A3P5PS, A3P5P AND A2P5P. THESE INHIBIT CALCIUM ION MOBILIZATION 

CC AND SHAPE CHANGE IN PLATELETS. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC ■ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 

DR EMBL; Z49205; CAA89066.1; -. 

DR EMBL; U42030; AAA97873.1; -. 

DR EMBL; U42029; AAA97872.1; -. 

DR EMBL; S81950; AAB47091.1; -. 

DR EMBL; AJ006945; CAA07339.1; 

DR EMBL; AF018284; AAB94556.1; 

DR PIR; JC4737; JC4737. 

DR HSSP; P34996; 1DDD. 

DR Genew; HGNC:8539; P2RY1. 

DR MIM; 601167; -. 

DR GO; GO: 0005887; C:integral to plasma membrane; TAS . 



DR 


GO; GO:0004872; F: 


receptor . 


activity; TAS . 


DR 


GO; GO:0007200; P: 


G-protein 


signaling, coupled to IP3 second . . . ; 


DR 


InterPro; 


IPR000276; GPCR_Rhodpsn . 


DR 


Pfam; PF00001; 7tm_l; 1. 




DR 


PRINTS; PR00237; GPCRRHODOPSN . 


DR 


PROSITE; 


PS00237; 


G PROTEIN 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G__PROTEIN 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein; Platelet; 


KW 


Blood coagulation. 






FT 


DOMAIN 


1 


52 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


53 


74 


1 ( POTENTIAL) . 


FT 


DOMAIN 


75 


87 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


88 


109 


2 (POTENTIAL) . 


FT 


DOMAIN 


110 


126 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


127 


147 


3 (POTENTIAL) . 


FT 


DOMAIN 


148 


166 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


167 


188 


4 (POTENTIAL) . 


FT 


DOMAIN 


189 


218 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


219 


238 


5 (POTENTIAL) . 


FT 


DOMAIN 


239 


265 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


266 


285 


6 (POTENTIAL) . 


FT 


DOMAIN 


286 


303 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


304 


328 


7 (POTENTIAL) . 


FT 


DOMAIN 


329 


373 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


124 


202 


BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 


N-LINKED ( GLCNAC . . .) (POTENTIAL). 


FT 


CARBOHYD 


27 


27 


N-LINKED (GLCNAC- . .) (POTENTIAL). 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


197 


197 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CONFLICT 


138 


138 


MISSING (IN REF . 1) . 


SQ 


SEQUENCE 


373 AA; 


42071 MW; 4DC7C668B4 145392 CRC64; 



Query Match 28.7%; Score 474; DB 1; Length 373; 

Best Local Similarity 37.3%; Pred. No. 3.5e-25; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS Y 79 

|||||:|||||| : : : : | I I I : : I I : I I I : : : I I : : I I I I I I 
Db 52 YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I :: I I I I I I I I I I I I II : I I : M : I I I 

Db 112 FNKTDWIFGDAMCKLQRFIFHWLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAICI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : M : I : : I : I : : I I : II I I I I : I I : I : I I 

Db 172 SVLVWLIWVAISPILFYSGTGVRKNKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

Qy 198 Vl^CFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAVVIFSILFTPYHIMRNLRIASRL 256 

: : | : | | : : II I I I : : : : I : : : I : I : I : : : : I I 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKSIY L VI I VLT VFAVS YIP FH VMKTMN L RARL 288 

Qy 257 D-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I I I : : I : I I I I M I : : I I I M I I : I I 

Db 289 DFQT PAMCAFNDRVYAT YQVTRGLAS LNS CVDP I L YFLAGDT FRRRL 335 



RESULT 5 
P2YR_CHICK 

ID P2YR_CHICK STANDARD; PRT; 362 AA. 

AC P34996; 

DT Ol-FEB-1994 (Rel. 28, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) . 

GN P2RY1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE— 932 8 534 0; PubMed=8 5 08 924; 

RA Webb T.E., Simon J., Krishek B.J., Bateson A.N., Smart T.G., 

RA King B.F., Burnstock G., Barnard E.A. ; 

RT "Cloning and functional expression of a brain G-protein-coupled ATP 

RT receptor. " ; 

RL FEBS Lett. 324:219-225(19 93). 

RN [2] 

RP 3D- STRUCTURE MODELING. 

RX MEDLINE=97026278; PubMed=8872457 ; 

RA van Rhee A.M., Fischer B. , van Galen P.J.M., Jacobson K.A. ; 

RT "Modelling the P2Y purinoceptor using rhodopsin as template."; 

RL Drug Des. Discov. 13:133-140(1995). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 
CC ATP AND ADP . SEEMS TO MEDIATE ITS ACTION VIA A PERTUSSIS TOXIN 

CC INSENSITIVE G-PROTEIN, PROBABLY BELONGING TO THE GQ FAMILY THAT 

CC ACTIVATE A PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: BRAIN, SPINAL CORD, GASTROINTESTINAL TRACT, 
CC SPLEEN AND LEG MUSCLE. IS NOT DETECTED IN THE HEART, LIVER, 

CC STOMACH, LUNG AND KIDNEY. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X73268; CAA51716.1; 

DR PIR; S33733; S33733. 

DR PDB; 1DDD; ll-JUL-96. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_JL; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1__1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN__RECEP_Fl_2 ; 1. 

KW G-protein .coupled receptor; Transmembrane; Glycoprotein; 3D-structure . 

FT DOMAIN 1 41 EXTRACELLULAR (POTENTIAL). 



FT 


TRANSMEM 


42 


63 


1 (POTENTIAL) . 


FT 


DOMAIN 


64 


76 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


77 


98 


2 (POTENTIAL) . 


FT 


DOMAIN 


99 


115 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


116 


136 


3 (POTENTIAL) . 


FT 


DOMAIN 


137 


155 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


156 


177 


4 (POTENTIAL). 


FT 


DOMAIN 


178 


207 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


208 


227 


5 (POTENTIAL) . 


FT 


DOMAIN 


228 


254 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


255 


274 


6 (POTENTIAL) . 


FT 


DOMAIN 


275 


292 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


293 


317 


7 (POTENTIAL) . 


FT 


DOMAIN 


318 


362 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


113 


191 


BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 


N-LINKED (GLCNAC. . . ) ( POTENTIAL) 


FT 


CARBOHYD 


26 


26 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


102 


102 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


186 


186 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


HELIX 


42 


69 




FT 


HELIX 


77 


102 




FT 


HELIX 


115 


137 




FT 


HELIX 


154 


178 




FT 


HELIX 


205 


231 




FT 


HELIX 


250 


275 




FT 


HELIX 


290 


305 




FT 


TURN 


306 


307 




FT 


HELIX 


308 


320 




SQ 


SEQUENCE 


362 AA; 


41194 


MW; A806C88FB9514761 CRC64; 



Query Match 28.6%; Score 472.5; DB 1; Length 362; 

Best Local Similarity 35.9%; Pred. No. 4.3e-25; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8; 

Qy 10 WLA TEAILNK Y YL S AF YAI E FI FGLLGN VT WFGYL FCMKNWN S S NVYL FN 60 

II I : II III I : I I I I I I : : : : I I : I : : I I : I I 

Db 22 WAAGNATTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGI SVYMFN 81 

Qy 61 LSISDFAFLCTLPILIKSYANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLL 119 

I : : : I I : : I I I I I II : I I I : I I : : I I I I I I I I I I I I : I I 

Db 82 IxAJJU)FLWLTLPAIiIFYYFNKTDWIFGDVMCKLQRFIFHVNLYGSILFLTCISVHRYTG 141 

Qy 120 MKYPFREHFLQKKEFAILISLAWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 178 

: : I : I I : I : : I I I I I I : I : I : : | : : II : 

Db 142 V^PLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGTGVRRNKTITCYDTTADEYLR 201 

Qy 179 HNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIF 237 

: II : I I : III I : I : I I : : II I I I : : : : I 
Db 202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKS I YLVI I VLTVF 258 

Qy 238 S I LFTP YHIMRNLRI ASRLD- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINP I FYFLMGD 295 

: : : I : I : I : I : : I I I III : : I : I I I I I I I : : I I I I I I I 

Db 259 AVSYLPFHVMKTLNLRARLDFQTPQMCAFNDKAATATYQVTRGLASLNSCVDPILYFLAGD 318 

Qy 296 HYREML 301 

: I I 

Db 319 TFRRRL 324 



RESULT 6 
P2YR_MELGA 

ID P2YRJ4ELGA STANDARD; PRT; 362 AA. 

AC P49652; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 15-JUL-1998 (Rel. 36, Last annotation update) 

DE P2Y purinoceptor 1 (ATP receptor) (P2Y1) (Purinergic receptor) (6H1 

DE orphan receptor) . 

GN P2RY1. 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Meleagrididae; Meleagris. 

OX NCBI_TaxID=9103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE= 94335907; PubMed= 8 058 061; 

RA Filtz T.M., Li Q., Boyer J.L., Nicholas R. A. , Harden T.K.; 

RT "Expression of a cloned P2Y purinergic receptor that couples to 

RT phospholipase C"; 

RL Mol. Pharmacol. 4 6:8-14(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97382456; PubMed-924 04 60 ; 

RA Li Q., Schachter J.B., Harden T.K., Nicholas R.A. ; 

RT "The 6H1 orphan receptor, claimed to be the p2y5 receptor, does not 

RT mediate nucleotide-promoted second messenger responses."; 

RL Biochem. Biophys . Res. Commun. 236:455-460(1997). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADENINE NUCLEOTIDES SUCH AS 
CC ATP AND ADP . SEEMS TO MEDIATE ITS ACTION VIA A PERTUSSIS TOXIN 

CC INSENSITIVE G-PROTEIN, PROBABLY BELONGING TO THE GQ FAMILY THAT 

'CC ACTIVATE A PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: MAINLY FOUND IN BLOOD, BRAIN, AND LUNG. TO A 

CC LESSER EXTENT IN STOMACH, GUT, AND SKELETAL MUSCLE. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U09842; AAA18784.1; -. 

DR EMBL; AF012103; AAB65428.1; 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 
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2 93 


ol / 


/ ( POl CjN I lAlt ; . 
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DOMAIN 
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362 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


113 


191 


BY SIMILARITY. 


FT 


CARBOHYD 


11 


11 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


26 


26 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 
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N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


186 


186 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


362 AA; 


41180 MW; 


3E128AB9EB64349C CRC64; 


Query Match 




28.6%; 


Score 472.5; DB 1; Length 362; 



Best Local Similarity 35.9%; Pred. No. 4.3e-25; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8; 

10 WLA TEAILNK YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFN 60 

II I : II III I : I I I I I I : : : : I I : I : : I I : I I 

22 WAAGNASTKCSLTKTGFQFYYLPTVYILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 

61 LSI SDFAFLCTLPI LI KS YANDKG— TYGDVLCI SNRYVLHTNL YTS I LFLTFI SMDRYLL 119 

I : : : M : : I I I I I II : I I I : I I : : I I M I I I I I I I I : I I 

82 LALADFLYVLTLPALI FYYFNKTDWI FGDVMCKLQRFI FHVNLYGS I LFLTCI SVHRYTG 141 

12 0 MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 178 

: : I : I I : I : : I I I I I I : I : I : : I : : II : 

142 WHPLKSLGRLKKKNAVYVSSLVWALWAVIAPILFYSGTGVRRNKTITCYDTTADEYLR 201 

179 HNLIYSLCLTLLGFLIPLSVTV[CFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIF 237 

: I I : I I : III I : I : I I : : II I I I : : : : I 
202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKSIYLVIIVLTVF 258 

23 8 SILFTPYHIMRNLRIASRLD-SWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 295 

: : : I : I : I : I : : I I I III :: I : I I I I I I I :: I I I I I I I 

259 AVSYLPFHvTyiKTLNLRARLDFQTPQMCAFNDKVYATYQVTRGLASLNSCVDPILYFLAGD 318 

296 HYREML 301 

: I I 
319 TFRRRL 324 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



RESULT 7 
P2Y8_XENLA 

ID P2Y8_XENLA STANDARD; PRT; 537 AA. 

AC P79928; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 



DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE P2Y purinoceptor 8 (P2Y8) . 

GN P2RY8. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI__TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Neural plate; 

RX MEDLINE-97284734; PubMed=91397 11 ; 

RA Bogdanov Y.D., Dale L., King B.F., Whittock N., Burnstock G. ; 

RT "Early expression of a novel nucleotide receptor in the neural plate 

RT of Xenopus embryos."; 

RL J. Biol. Chem. 272:12583-12590(1997). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ATP, UTP, CTP, GTP AND ITP. 
CC THE ACTIVITY OF THIS RECEPTOR IS MEDIATED BY G PROTEINS WHICH 

CC ACTIVATE A PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC MAY PLAY A KEY ROLE IN THE EARLY DEVELOPMENT OF NEURAL TISSUE. 

CC ■ -»- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 










DR 


EMBL; X99953; CAA6 


8213.1; 




DR 


HSSP; P34996; 1DDD 






DR 


InterPro; 


IPR000276; GPCR_Rhodpsn . 


DR 


Pfam; PF00001; 7tm 


. 1; 1. 




DR 


PRINTS; PR00237; GPCRRHODOPSN . 


DR 


PROSITE; 


PS00237; 


G PROTEIN 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN~ 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein. 
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7 (POTENTIAL) . 
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BY SIMILARITY. 
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CARBOHYD 


26 


26 


N-LINKED (GLCNAC. . .) ( POTENTIAL) 


FT 


CARBOHYD 


29 


29 


N-LINKED (GLCNAC. . .) (POTENTIAL) 



SQ SEQUENCE 537 AA; 



62024 MW; 



B2CF24812F3C19F2 CRC64 ; 



Query Match 23.4%; Score 386.5; DB 1; Length 537; 

Best Local Similarity 35.3%; Pred. No. 3.6e-19; 

Matches 104; Conservative 51; Mismatches 125; Indels 15; Gaps 7; 

Qy 22 LSAFYAI EFI FGLLGN VTVVFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YAN 81 

I I : I : I I I : : : : : I : I I : II : I I I : : I I : : : I I I : II 

Db 46 LPVSYSAVFMVGLPLNI7W4WIFIAKMRPWNPTTVYMFNLALSDTLYVLSLPTLVYYYA- 104 

Qy 82 DKGT--YGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILIS 139 

II : I : I I I I : : : I I I : I I I I I I I I : I I : : I : I : I 

Db 105 DKNNWPFGEVLCKLVRFLFYANLYSSILFLTCISVHRYRGVCHPITSLRRMNAKHAYVIC 164 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

II | | | : : | | | : I I : : I I : : I I : I I I I : : 

Db 165 ALVWLSVTLCLVPNLI FVTVS PKVKNTI CHDTTRPEDFARYVEYSTAIMCLLFGI PCLI I 224 

Qy 200 CFFYYKMWFLKRR — SQQQATALPLDKPQ — RL WLAWI FS I L FT P YH I MRN L RI AS R 255 

I | I : I I I II I : : :: |:| I |:|| I I :| 

Db 225 AGCYGLMTRELMKPIVSGNQQT-LPSYKKRSIKTIIFVMIAFAICFMPFHITRTLYYYAR 283 

Qy 256 LDSWPQGCTQKAIKSI YTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQ 307 

I I I : I I : II I I I llhlllll III I 1 I : 

Db 284 L LGIKCYALNVINWYKVTRPLASANSCIDPILYFLANDRYRRRLIRTVRR 334 



RESULT 8 
P2Y3_CHICK 

ID P2Y3_CHICK STANDARD; PRT; 328 AA. 

AC Q98907; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE P2Y purinoceptor 3 (P2Y3) (Nucleoside diphosphate receptor) . 

GN P2RY3. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=96319774; PubMed=8700132 ; 

RA Webb T.E., Henderson D. , King B.F., Wang S., Simon J., 

RA Bateson A.N., Burnstock G., Barnard E.A. ; 

RT "A novel G protein-coupled P2 purinoceptor (P2Y3) activated 

RT preferentially by nucleoside diphosphates."; 

RL Mol. Pharmacol. 50:2 58-265(1996). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR ADP > UTP > ATP = UDP. THE 
CC ACTIVITY OF THIS RECEPTOR IS MEDIATED BY G PROTEINS WHICH ACTIVATE 

CC A PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . . 



CC 
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HSSP; P34996; 1DDD 








DR 


InterPro; 


IPR000276; GPCR_ 


Rhodpsn . 


DR 


Pfam; PF00001; 7tm 


. 1; 1. 






DR 


PRINTS; PR00237; GPCRRHODOPSN . 


DR 


PROSITE; 


PS00237; 


G_PROTEIN 


_RECEP_F1__1 ; FALSE_NEG . 


DR 


PROSITE; 


PS50262; 


G PROTEIN 


_RECEP_F1_2 ; 1 . 
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G-protein 


coupled 


receptor 




Transmembrane; Glycoprotein 
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N-LINKED ( GLCNAC . . .) {VO r . 


FT 


DISULFID 


94 


172 




BY SIMILARITY . 


SQ 


SEQUENCE 


328 AA; 


37586 


MW; 7A3BF1C91F54FAAB CRC64; 



Query Match 23.0%; Score 379.5; DB 1; Length 328; 

Best Local Similarity 29.6%; Pred. No. 6.8e-19; 

Matches 88; Conservative 63; Mismatches 135; Indels 11; Gaps 4; 

Qy 22 LSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLSI SDFAFLCTLPI LIKS YA- 80 

I | : : | : | | | | : | : : I : I I : : : I : : I : I I : I I : I 

Db 24 LPLVYSWFLLGLPLNAWI GQIWLARKALTRTTIYMLNLAMADLLYVCSLPLLI YNYTQ 83 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : I I I I : : I I I : I I I I I I I I : I I : : : I I : I = I : 

Db 84 KDYWPFGDFTCKFVRFQFYTNLHGSILFLTCISVQRYMGICHPLASWHKKKGKKLTWLVC 14 3 

Qy 140 LAW7VLVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

I I I : I : | | • | :: : : I I : I : I I : I I I : I : : 

Db 144 AAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSTSYFPYGITLTITGFLLPFAAI 203 

Qy 200 CFFYYKMWFLKRRSQQQATAL — PLDKPQRLWLAWI FSI LFT PYHIMRNLRIASRLD 257 

I I I : : : I : II | : : : : | : : | | | | | : | : : : : I 

Db 204 LACYCSMARILCQKDELIGLAVHKKKDKAVRMIIIWIVFSISFFPFHLTKTIYLIVRSS 263 

Qy 258 SWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE MLISKFRQ 307 

: I : I I I I I I : I I : : I I : : : I I : I I : I I 

Db 2 64 ASLPCPTLQAFAIAYKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKMSSKWRQ 320 



RESULT 9 
P2Y2_RAT 

ID P2Y2_RAT STANDARD; PRT; 374 AA. 

AC P41232; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) ( P2U purinoceptor 1) (P2U1) (ATP receptor) 

DE (Purinergic receptor) . 

GN P2RY2 OR P2RU1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP ^ SEQUENCE FROM N.A. 

RX ' MEDLINE-95110548; PubMed=7 8 114 68 ; 

RA Rice W.R., Burton F.M., Fiedeldey D.T.; 

RT "Cloning and expression of the alveolar type II cell P2u-purinergic 

RT receptor."; 

RL Am. J. Respir. Cell Mol . Biol. 12:27-32(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE=Pituitary ; 

RA Chen Z.P., Krull N. , Xu S., Levy A. , Lightman S.L.; 

RL Submitted (AUG-1995) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; 

RX MEDLINE=98099857; PubMed=94 372 11 ; 

RA Seye C.I., Gadeau A. P., Daret D., Dupuch F., Alzieu P., Capron L. , 

RA Desgranges C; 

RT "Overexpression of P2Y2 purinoceptor in intimal lesions of the rat 

RT aorta."; 

RL Arterioscler. Thromb . Vase. Biol. 17:3602-3610(1997). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U09402; AAA61565.1; 

DR EMBL; L46865; AAB02099.1; -. 

DR EMBL; U56839; AAC00048.1; -. 

DR HSSP; P34996; 1DDD. 

DR InterPro; JPR000276; GPCR_Rhodpsn . 



DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Pfam; PF00001; 7tm_l; 1. 

PRINTS; PR00237; GPCRRHODOPSN . 

PROSITE; PS00237; G_PR0TEIN_RECEP_F1__1 ; 1. 

PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein. 
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DISULFID 
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CONFLICT 

CONFLICT 

SEQUENCE 



1 

33 

60 

71 

94 
111 
130 
153 
173 
195 
221 
246 
269 
287 
309 
9 

13 
106 
132 
143 
166 
188 
374 AA; 



32 

59 

70 

93 
110 
129 
152 
172 
194 
220 
245 
268 
286 
308 
374 
9 

13 
183 
132 
143 
166 
188 

42137 



EXTRACELLULAR (POTENTIAL). 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 



N-LINKED (GLCNAC. . .) 
N-LINKED (GLCNAC. . .) 
BY SIMILARITY. 
C -> S (IN REF. 1) . 
S -> R (IN REF. 1) . 
A -> T (IN REF. 1) . 
A -> D (IN REF. 1) . 
MW; 2DA6853AA2 0A1AF3 CRC64; 



( POTENTIAL) 
(POTENTIAL) 



Query Match 22.7%; Score 374; DB 1; Length 374; 

Best Local Similarity 33.8%; Pred. No. 1.8e-18; 

Matches 96; Conservative 50; Mismatches 124; Indels 



14; Gaps 



7; 



QY 
Db 



19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

|| | I : : I I II : : : I : I I I : I I : I : I : : I I : : I I : I : 

32 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 



Qy 

Db 

Qy 

Db 

Qy 

Db 



7 8 S YA-NDKGT YGDVLCI SNRYVLHTNLYT S ILFLT FI SMDRYLLMKYP FREHFLQKKEFAI 136 

II I : III I ::: I I I I I I I I I I I I : I I : I : I 

92 YYAQGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLSWGHARYAR 151 

137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: : I I I I I : I I : : : I I : : | : | | : I I : I 

152 RVAAVW^LVIACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 211 

197 SVMCFFYYKMWFLKRRSQQQA TALPLDKPQ — RLWLAWI FS I LFTP YH IMRNLR 251 

| : : | I : I I : | III I : I : I : : I : : I I : I : I I 

212 SIILVCY VLMARRLLKPAYGTTGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLY 267 



Qy 



Db 



252 IASRLDSWPQGC-TQKAIKS I YTLTRPLAFLNSAINPI FYFLMG 294 

: I I II II I : I I I I I M : : I : I I I I 

268 YSFR — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 309 



^ RESULT 10 
P2Y2_MOUSE 

ID P2Y2 MOUSE STANDARD; PRT; 373 AA. 



AC P35383; 009031; Q9CPZ4; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) ( P2U purinoceptor 1) (P2U1) (ATP receptor) 

DE (Purinergic receptor) . 

GN P2RY2 OR P2RU1. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=93281707; PubMed=7685114 ; 

RA Lustig K.D., Shiau A.K., Brake A. J., Julius D. ; 

RT "Expression cloning of an ATP receptor from mouse neuroblastoma 

RT cells."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:5113-5117(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96316177; PubMed=87 39319 ; 

RA Enomoto K. , Furuya K., Moore R.C., Yamagishi S., Oka T., Maeno T.; 

RT "Expression cloning and signal transduction pathway of P2U receptor 

RT in mammary tumor cells."; 

RL Biol. Signals 5:9-21(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Head, and Liver; 

RX MEDLINE=2 1085660; PubMed=112178 51 ; 

RA Kawai J., Shinagawa A., Shibata K., Yoshino M. , Itoh M. , Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y . , Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y. , Gojobori T., Bono H., Kasukawa T., Saito R. , 

RA Kadota K. , Matsuda H.A., Ashburner M. , Batalov S., Casavant T. , 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G. , Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L. , Washio T . , 

RA Sakai K. , Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G. , 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D. , Hofmann M. , Hume D . A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L . , Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H., Sato K., Schoenbach C, Seya T . , Shibata Y. , Storch K.-F., 

RA Suzuki H., Toyo-oka K., Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 4 09:685-690(2 001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Breast tumor; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 



RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. f 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A. , Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [5] 

RP MUTAGENESIS OF LYS-107; ARG-110; HIS-262; ARG-265; LYS-289 AND 

RP ARG-2 92. 

RX MEDLINE=95181393; PubMed=7 87 6172 ; 

RA Erb L., Garrad R.C., Wang Y., Quinn T. , Turner J.T., Weisman G.A.; 

RT "Site-directed mutagenesis of P2U purinoceptors . Positively charged 

RT amino acids in transmembrane helices 6 and 7 affect agonist potency 

RT and specificity."; 

RL J. Biol. Chem. 270:4185-4188(1995). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: SPLEEN, TESTIS, KIDNEY, LIVER, LUNG, HEART AND 
CC BRAIN. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L14751; AAA39871.1; -. 

DR EMBL; S83099; AAB50735.1; -. 

DR EMBL; AK005013; BAB23746.1; -. 

DR EMBL; AK017378; BAB30719.1; -. 

DR EMBL; BC006613; AAH06613.1; 

DR PIR; A47556; A47556. 

DR HSSP; P34996; 1DDD. 

DR MGD; MGI: 105107; P2ry2. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1__1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 32 EXTRACELLULAR (POTENTIAL) . 



FT 


TRANSMEM 


33 


59 


1 (POTENTIAL) . 


FT 


DOMAIN 


60 


70 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


71 


93 


2 (POTENTIAL) . 


FT 


DOMAIN 


94 


110 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


111 


129 


3 (POTENTIAL) . 


FT 


DOMAIN 


130 


152 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


153 


172 


4 (POTENTIAL) . 


FT 


DOMAIN 


173 


194 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


195 


220 


5 (POTENTIAL) . 


FT 


DOMAIN 


221 


246 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


247 


269 


6 (POTENTIAL). 


FT 


DOMAIN 


270 


287 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


288 


309 


7 (POTENTIAL) . 


FT 


DOMAIN 


310 


373 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


9 


9 


N-LINKED ( GLCNAC . . . ) (POTENTIAL). 


FT 


CARBOHYD 


13 


13 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


DISULFID 


106 


183 


BY SIMILARITY. 


FT 


MUTAGEN 


107 


107 


K->I: NO EFFECT ON RECEPTOR ACTIVATION 


FT 


MUTAGEN 


110 


110 


R->L: NO EFFECT ON RECEPTOR ACTIVATION 


FT 


MUTAGEN 


262 


262 


H->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 


MUTAGEN 


265 


265 


R->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 


MUTAGEN 


289 


289 


K->I: NO EFFECT ON RECEPTOR ACTIVATION 


FT 


MUTAGEN 


292 


292 


R->L: DECREASE IN RECEPTOR ACTIVATION. 


FT 


CONFLICT 


17 


17 


E -> D (IN REF. 2) . 


FT 


CONFLICT 


120 


120 


S -> R (IN REF. 2) . 


FT 


CONFLICT 


125 


125 


T -> N (IN REF. 2) . 


FT 


CONFLICT 


196 


196 


V -> M (IN REF. 2) . 


FT 


CONFLICT 


263 


263 


V -> L (IN REF. 3) . 


FT 


CONFLICT 


355 


355 


D -> N (IN REF. 2) . 


FT 


CONFLICT 


369 


371 


KDI -> PYV (IN REF. 2) . 


SQ 


SEQUENCE 


373 AA; 


42174 


MW; 590BBE502E41B3AE CRC64; 



Query Match 22.6%; Score 373.5; DB 1; Length 373; 

Best Local Similarity 33.5%; Pred. No. 1.9e-18; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5; 

Qy 19 KY-YLSAFYAIEFI FGLLGNVTVVFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : I I II : : : I : I M : I I : I : I : : I I : : I I : I : 
Db 32 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 

Qy 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

II I : III I ::: I I I I I I II I I I I : I I : I : : I 

Db 92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLRWGRARYAR 151 



Qy 137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

:: I I I I I I :: I : I I : I I :| 

Db 152 RVAAVVWLVIACQAPVXYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 211 

Qy 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQ — RLWLAWIFSILFTPYHIMRNLRIAS 254 

II: I I I : : II I : I : I : : I : : I I : I : I I : 

Db 212 SVILVCWLMARRLLKPAYGTTGGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLYYSF 271 



Qy 255 RLDSWPQGC-TQKAI KS I YTLTRPLAFLNSAINPI FYFLMG 294 

I I II II I : I I I II I I : : I : I I I I 

Db 272 R — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 



RESULT 11 
P2Y3_MELGA 

ID P2Y3_MELGA STANDARD ; PRT; 32 8 AA. 

AC 093361; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE P2Y purinoceptor 3 (P2Y3) (Nucleoside diphosphate receptor) . 

GN P2RY3. 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Meleagrididae; Meleagris. 

OX NCBI_TaxID=9103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE= 98 4 01046; PubMed= 9730913; 

RA Li Q., OleskyM., Palmer R.K., Harden T.K., Nicholas R.A. ; 

RT "Evidence that the p2y3 receptor is the avian homologue of the 

RT mammalian P2Y6 receptor."; 

RL Mol. Pharmacol. 54:541-546(1998). 

CC -!- FUNCTION: RECEPTOR FOR EXTRACELLULAR UDP> ADP = UTP. THE ACTIVITY 

CC OF THIS RECEPTOR IS MEDIATED BY G PROTEINS WHICH ACTIVATE A 

CC PHOSPHATIDYLINOSITOL-CALCIUM SECOND MESSENGER SYSTEM. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 













DR 


EMBL; AF069555; AAC23863.1; 




DR 


HSSP; P34996; 1DDD. 




DR 


Inter Pro ; 


IPR000276; GPCR Rhodpsn. 


DR 


Pfam; PF00001; 7tm 1; 1. 




DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 


DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


RECEP Fl_l; FALSE_NEG. 


DR 


PROSITE; 


PS50262; 


G PROTEIN" 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoproteii 


FT 


DOMAIN 


1 


22 


EXTRACELLULAR ( POTENTIAL ) 


FT 


TRANSMEM 


23 


43 


1 (POTENTIAL) . 


FT 


DOMAIN 


44 


57 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


58 


78 


2 (POTENTIAL) . 


FT 


DOMAIN 


79 


96 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


97 


117 


3 (POTENTIAL) . 


FT 


DOMAIN 


118 


139 


CYTOPLASMIC (POTENTIAL). 


FT 


TRANSMEM 


140 


160 


4 (POTENTIAL) . 


FT 


DOMAIN 


161 


189 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


190 


210 


5 (POTENTIAL) . 


FT 


DOMAIN 


211 


231 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


232 


252 


6 ( POTENTIAL) . 


FT 


DOMAIN 


253 


275 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


276 


298 


7 (POTENTIAL) . 


FT 


DOMAIN 


299 


323 


EXTRACELLULAR (POTENTIAL) 



FT 
FT 

SQ 



CARBOHYD 
DISULFID 
SEQUENCE 



5 5 N-LINKED (GLCNAC. . .) (POTENTIAL) . 

94 172 BY SIMILARITY. 

328 AA; 37594 MW; B74D49B99C7164A5 CRC64; 



Query Match 22.5%; Score 371.5; DB 1; Length 328; 

Best Local Similarity 29.4%; Pred. No. 2.3e-18; 

Matches 87; Conservative 61; Mismatches 137; Indels 11; Gaps 4; 

Qy 22 LSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YA- 80 

I | : : | : | I I | : I : : i : I I : : I : : I : I I : I I : I 

Db 24 LPLWSWFLLGLPLNAWIGQIWLARKALTRTTIYMLNLATADLLYVCSLPLLIYNYTQ 83 

Qy 81 NDKGTYGDVLCI SNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : I I I I : : I I I : I I I I II I I : I I : : : I I : I : I : 

Db 84 KDYWPFGDFTCKFVRFQFYTNLHGSILFLTCISVQRYMGICHPLASWHKKKGKKLTWLVC 143 

Qy 14 0 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

111:1 : I I I : : : : I I : I : I I : I I I : I : : 

Db 144 AAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSASYFPYGITLTITGFLLPFAAI 2 03 

Qy 200 CFFYYKMWFLKRRSQQQATAL — PLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLD 257 

I I I : : : I : II I : : : : I : : I I I I I : I : : : : I 

Db 204 LACYCSMARI LCQKDELI GLAVHKKKDKAVRMI 1 1 WI VFS I S FFP FHLTKT I YLI VRS S 263 

Qy 258 SWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYRE MLISKFR 306 

I : I _ I I I I I : I I : : I I : : : I I : I I : I 

Db 264 PTLPCPTLQAFAIAYKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKMSSKWR 319 



RESULT 12 
P2Y2_HUMAN 

ID P2Y2_HUMAN STANDARD; PRT; 377 AA. 

AC P41231; Q96EM8; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE P2Y purinoceptor 2 (P2Y2) (P2U purinoceptor 1) (P2U1) (ATP receptor) 

DE (Purinergic receptor). 

GN P2RY2 OR P2RU1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Airway epithelium; 

RX MEDLINE=94211846; PubMed=8 159738 ; 

RA Parr C.E., Sullivan D.M., Paradiso A.M., Lazarowski E.R., Burch L.H., 

RA Olsen J.C., Erb L. , Weisman G. A. , Boucher R.C., Turner J.T.; 

RT "Cloning and expression of a human P2U nucleotide receptor, a target 

RT for cystic fibrosis pharmacotherapy."; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:3275-327 9(1994). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=95108098; PubMed=7 8 09171; 

RA Parr C.E., Sullivan D.M. , Paradiso A.M., Lazarowski E.R., Burch L.H., 

RA Olsen J.C., Erb L . , Weisman G. A. , Boucher R.C., Turner J.T.; 



RT "Cloning and expression of a human P2U nucleotide receptor, a target 

RT for cystic fibrosis pharmacotherapy."; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:13067-13067(1994). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Kidney, and Leukocyte; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K . , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S . , Worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC The affinity range is UTP = ATP > ATP-gamma-S » 2-methylthio-ATP 

CC = ADP. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: SPLEEN, TESTIS, KIDNEY, LIVER, LUNG, HEART AND 
CC BRAIN. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way ' 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U07225; AAC04923.1; 

DR EMBL; BC012104; AAH12104.1; -. 

DR EMBL; BC028135; AAH28135.1; -. 

DR HSSP; P34996; 1DDD. 

DR Genew; HGNC 8541; P2RY2 . 

DR MIM; 600041; -. 

DR GO; GO: 0005887; C: integral to plasma membrane; TAS . 

DR GO; GO:0004872; F:receptor activity; TAS. 

DR GO; GO: 0006873; P:cell ion homeostasis; TAS. 

DR GO; GO: 0007200; P:G-protein signaling, coupled to IP3 second . . .; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



PROSITE; 

PROSITE; 

G-protein 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

CARBOHYD 

CARBOHYD 

DISULFID 

CONFLICT 

CONFLICT 

CONFLICT 

SEQUENCE 



PS00237; 
PS50262; 
coupled 
1 
33 
60 
71 
94 
111 
130 
153 
173 
195 
221 
247 
270 
288 
310 
9 
13 
106 
312 
350 
359 
377 AA; 



G_PROTEIN_RECEP_Fl_l ; 1 . 
G_PROTEIN_RECEP_Fl_2 ; 1 . 
receptor; Transmembrane; Glycoprotein. 



32 

59 

70 

93 
110 
129 
152 
172 
194 
220 
246 
269 
287 
309 
377 
9 

13 
183 
312 
350 
359 

42289 MW; 



EXTRACELLULAR (POTENTIAL) . 

1 (POTENTIAL) . 
CYTOPLASMIC ( POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR ( POTENTIAL) 

5 (POTENTIAL) ♦ 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 



N-LINKED ( GLCNAC . 
N-LINKED (GLCNAC. 
BY SIMILARITY. 
R -> S (IN REF. 3; 
E -> G (IN REF. 1) 
S -> F (IN REF. 1) 
EE557A857A269AC6 



Query Match 22.0%; Score 363.5; DB 1 

Best Local Similarity 33.5%; Pred. No. 9.1e-18; 
Matches 94; Conservative 52; Mismatches 12 8; 



.) 
-) 



(POTENTIAL) 
(POTENTIAL) 



AAH12104) . 



CRC64; 

Length 377; 
Indels 7; 



Gaps 



5; 



Qy 

Db 



19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : I I I : : : I : I I I : I I : I : I : : I I : : I I : I : 
32 KYVLLPVSYGWCVLGLCLNAVALYIFLCRLKTWNASTTYMFHLAVSDALY7VASLPLLVY 91 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



7 8 S YA-NDKGT YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI 136 

II I : III I ::: I I I I I I I II I I I : I I : II : : I 

92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLRSLRWGRARYAR 151 

137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: : I I I I I | : | | : : : I I : : : I I : I I : I 

152 RVAGAVWLVIACQAPV^YFVTTST^GGRVTCHDTSAPELFSRFVAYSSVMLGLLFAVPF 211 

197 S VMC FFY YKMWFLKRRS QQQAT AL P LDKPQ RLWLAW — IFSILFTPYHIMRNLRIAS 254 

: I : I I I : : : II I : : : I I I : I : : I I : I : I I : 
212 AVI LVC YVLMARRLLKPAYGT S GGLP RAKRKS VRT I AWLAVFALCFL P FHVT RT L Y Y S F 271 

255 RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

I I II II 1:11111 II : : I : I I I I 

272 R— SLDLSCHTLNAINMAYKVTRPLASANSCLDPVLYFLAG 310 



RESULT 13 
P2Y4_RAT 

ID P2Y4_RAT STANDARD; PRT; 361 AA. 

AC 035811; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 



DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 4 (P2Y4) . 

GN P2RY4 OR P2Y4 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=Sprague-Dawley; TISSUE^Liver; 

RA Bogdanov Y.D., Wildman S., King B.F., Burntock G. ; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN^Sprague-Dawley; TISSUE=Brain; 

RX MEDLINE=984217 85; PubMed=9751165 ; 

RA Webb T.E., Henderson D., Roberts J. A., Barnard E.A. ; 

RT "Molecular cloning and characterization of the rat P2Y4 receptor."; 

RL J. Neurochem. 71:1424-1434(1998). 

CC -!- FUNCTION: Receptor for ATP and UTP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC Not activated by ADP or UDP . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Widely expressed at low levels. In brain, 
CC higher expression in the pineal gland and ventricular system. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC ; 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y14705; CAA75007.1; -. 

DR EMBL; Y11433; CAA7224 1 . 1 ; 

DR HSSP; P34996; 1DDD. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS502 62; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 
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5 (POTENTIAL) . 
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CYTOPLASMIC (POTENTIAL) . 


FT 
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6 (POTENTIAL) . 


FT 


DOMAIN 


266 


283 


EXTRACELLULAR (POTENTIAL 



FT 


TRANSMEM 


284 


305 


7 (POTENTIAL) . 


FT 


DOMAIN 


306 


361 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


104 


181 


BY SIMILARITY. 


FT 


CARBOHYD 


175 


175 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


361 AA; 


40893 


MW; 0377F96E54B449A3 CRC64; 



Query Match 21.5%; Score 354; DB 1; Length 361; 

Best Local Similarity 30.6%; Pred. No. 3.8e-17; 

Matches 86; Conservative 60; Mismatches 125; Indels 10; Gaps 5; 



Qy 26 YAI EFI FGLLGNVTVVFGYLFCMKNWNSSN VYLFNLSI SDFAFLCTLPI LI KS Y-ANDKG 84 

I I : I : I I I : : : I I : : I : : : I : I : I : : I I : : : I I I : I I : 
Db 38 YAWFVLGIJUiNAPTLWLFLFRLRPWDATATYMFHIALSDTLYVLSLPTLVYYYAARNHW 97 



Qy 85 TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISLAVWA 144 

: I II I : : : I I I I : I I I I I I : I I I : : I I : I I I : I I I 

Db 98 PFGTGLCKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRAIRWGRPRFASLLCLGVWL 157 



Qy 145 LVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYY 204 

: I : : | | I : : II : I : : I . : : I I : I : I 

Db 158 WAGCLVPNLFFVTTNANGTTILCHDTTLPEEFDHYVYFSSAVMVLLFGLPFLITLVCYG 217 



Qy 205 KMWFLKR RSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDSWPQ 261 

I II : I : : I : I : : : : I : : I I : I I I : : I I 
Db 218 LMARRL YRP L P GAGQ S S S RL RSLRTIAWLTVFAVCFVPFHITRTI YYQARL — LQA 272 

Qy 262 GC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

I : : I : I I I I I I I : : I : I I I I I I 

Db 273 DCHVLNIVNWYKVTRPLASANSCLDPVLYLFTGDKYRNQL 313 



RESULT 14 
C3X1_HUMAN 

ID C3X1_HUMAN STANDARD; PRT; 355 AA. 

AC P49238; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE CX3C chemokine receptor 1 (C-X3-C CKR-1) (CX3CR1) (Fractalkine 

DE receptor) (GPR13) (V28) (Beta chemokine receptor-like 1) (CMK-BRL-1) 

DE (CMKBLR1) . 

GN CX3CR1 OR GPR13 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96011651; PubMed=7590284 ; 

RA Raport C.J., Schweickart V.L., Eddy R.L. Jr., Shows T.B., Gray P.W.; 

RT "The orphan G-protein-coupled receptor-encoding gene V28 is closely 

RT related to genes for chemokine receptors and is expressed in lymphoid 

RT and neural tissues."; 

RL Gene 163:2 95-2 99(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95374679; PubMed=7 646814 ; 



RA Combadiere C, Ahuja S.K., Murphy P.M.; 

RT "Cloning, chromosomal localization, and RNA expression of a human 

RT beta chemokine receptor-like gene."; 

RL DNA Cell Biol. 14:673-680(1995). 

RN [3] 

RP CHARACTERIZATION. 

RX MEDLINE=98050927; PubMed=9390561 ; 

RA Imai T., Hieshima K., Haskell C, Baba M. , Nagira M. , Nishimura M. , 

RA Kakizaki M. , Takagi S., Nomiyama H., Schall T.J., Yoshie O.; 

RT "Identification and molecular characterization of fractalkine receptor 

RT CX3CR1, which mediates both leukocyte migration and adhesion. "; 

RL Cell 91:521-530(1997). 

RN [4] 

RP CHARACTERIZATION. 

RX MEDLINE=98395093; PubMed=9726990 ; 

RA Combadiere C, Salzwedel K. , Smith E.D., Tiffany H.L., Berger E.A. , 

RA Murphy P.M. ; 

RT "Identification of CX3CR1. A chemotactic receptor for the human CX3C 

RT chemokine fractalkine and a fusion coreceptor for HIV-l."; 

RL J. Biol. Chem. 2 73:237 99-238 04(1998). 

RN [5] 

RP VARIANTS ALA-57; ILE-122; ILE-249 AND MET-280. 

RX MEDLINE-2 0196025; PubMed-10731151 ; 

RA Faure S., Meyer L., Costagliola D., Vaneensberghe C, Genin E., 

RA Autran B., Delfraissy J.-F., McDermott D.H., Murphy P.M., Debre P., 

RA Theodorou I., Combadiere C; 

RT "Rapid progression to AIDS in HIV+ individuals with a structural 

RT variant of the chemokine receptor CX3CR1."; 

RL Science 287:2274-2277(2000). 

CC -!- FUNCTION: RECEPTOR FOR THE CX3C CHEMOKINE FRACTALKINE AND MEDIATES 

CC BOTH ITS ADHESIVE AND MIGRATORY FUNCTIONS. ACTS AS CO-RECEPTOR 

CC WITH CD4 FOR HIV-l VIRUS ENVELOPE PROTEIN (IN VITRO) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN LYMPHOID AND NEURAL TISSUES. 

CC -!- DISEASE: INCREASED SUSCEPTIBILITY TO HIV INFECTION AND RAPID 

CC PROGRESSION TO AIDS ARE ASSOCIATED WITH THE I-249/M-280 HAPLOTYPE. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U20350; AAA91783.1; -. 

DR EMBL; U28934; AAA87032.1; -. 

DR PIR; JC4304; JC4304. 

DR Genew; HGNC:2558; CX3CR1. 

DR MIM; 601470; 

DR GO; GO: 0005887; C:integral to plasma membrane; TAS . 

DR GO; GO: 0004950; F: chemokine receptor activity; TAS. 

DR GO; GO: 0007155; P:cell adhesion; TAS. 

DR GO; GO: 0006968; P: cellular defense response; TAS. 

DR GO; GO: 0006935; P : chemotaxis ; TAS. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; TAS. 



DR 


InterPro; 


IPR005387; Fractalkiner . 


DR 


InterPro; 


IPR000276; GPCR Rhodpsn. 


DR 


Pfam; PF00001; 7tm 1; 1. 




DR 


PRINTS; PR01562; 


FRACTALKINER. 


DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 


DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN~ 


_RECEP_Fl_2 ; 1 . 


KW 


G-protein 


coupled 


receptor ; 


Transmembrane; Polymorphism. 
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FT 
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BY SIMILARITY. 


FT 
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57 


57 


T -> A. 


FT 








/FTId=VAR_010041. 


FT 


VARIANT 


122 


122 


V -> I. 


FT 








/FTId-VAR_010042 . 


FT 


VARIANT 


249 


249 


V -> I (COMMON POLYMORPHISM 


FT 








POPULATION) . 


FT 








/FTId=VAR 010043. 


FT 


VARIANT 


280 


280 


T -> M (COMMON POLYMORPHISM 


FT 








POPULATION) . 


FT 








/FTId=VAR 010044. 


SQ 


SEQUENCE 


355 AA; 40396 MW; C59DC5F4C4 312F22 CRC64; 



Query Match 21.4%; Score 353; DB 1; Length 355; 

Best Local Similarity 28.1%; Pred. No. 4.4e-17; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; 



Gaps 



6; 



Qy 

Db 



16 I LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI L 75 

: : I I I I : : I I I : I I : I I I i : : : : i I I I : : I I I : I I I 

28 VFGTVFLSIFYSVIFAIGLVGNLLWFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFW 87 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



7 6 I KS YANDKGT YGDVLCI SNRYVLHTNLYT S ILFLTFI SMDRYLLMKYP FREHFLQKKEFA 135 

I : I I : : : I : I I I : I I I : I I I I : : : 

88 THYLINEKGLH-NAMCKFTTAFFFI GFFGS I FFITVI S I DRYLAIVLAANSMNNRTVQHG 146 

136 ILISIAWALWLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : | | | | : 

147 VT I S LGVWAAAI LVAAPQFMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 

195 PLSVMCFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWI FSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : I I : : I : : I I : : I : : I I I : : I I 

2 03 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLI LLWI VFFLFWTP YNVMI FLETL 256 

254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREMLISKFRQYFKSL 312 



I • I • I • • • • I -II' 'II' I I • • I I • III 

Db 257 KLYDFFPSCDMRKDLRLALSOTETVAFSHCCLNPLIYAFAGEKFRRYL YHLYGKCL 312 

RESULT 15 
P2Y4_HUMAN 

ID P2Y4_HUMAN STANDARD; PRT; 365 AA. 

AC P51582; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE P2Y purinoceptor 4 (P2Y4) (Uridine nucleotide receptor) (UNR) (P2P) . 

GN P2RY4 OR NRU . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96125055; PubMed=8 537336 ; 

RA Cornmuni D., Pirotton S., Parmentier M. , Boeynaems J.-M.; 

RT "Cloning and functional expression of a human uridine nucleotide 

RT receptor . " ; 

RL J. Biol. Chem. 270:30849-30852(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96125054; PubMed=8 537335 ; 

RA Nguyen T., Erb L., Weisman G.A., Marchese A., Heng H.H.Q., 

RA Garrad R.C., George S.R., Turner J.T., O ' Dowd B.F.; 

RT "Cloning, expression, and chromosomal localization of the human 

RT uridine nucleotide receptor gene."; 

RL J. Biol. Chem. 270:30845-30848(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pancreas ; 

RX MEDLINE-96197801; PubMed-8 617367 ; 

RA Stam N.J., Klomp J., van der Heuvel M. , Olijve W. ; 

RT "Molecular cloning and characterization of a novel orphan receptor 

RT (P2P) expressed in human pancreas that shows high structural homology 

RT to the P2U purinoceptor."; 

RL FEBS Lett. 384:260-264(1996). 

CC -!- FUNCTION: Receptor for UTP and UDP coupled to G-proteins that 
CC activate a phosphatidylinositol-calcium second messenger system. 

CC Not activated by ATP or ADP . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Pancreas. 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; X91852; CAA62963.1; 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



EMBL; U40223; AAC50347.1; -. 
EMBL; X96597; CAA65415.1; -. 
PIR; S68679; S68679. 
HSSP; P34996; 1DDD. 
Genew; HGNC:8542; P2RY4. 
MIM; 300038; -. 

GO; GO:0005887; C:integral to plasma membrane; TAS . 

GO; GO: 0015065; F: uridine nucleotide receptor activity; TAS. 

GO; GO: 0007204; P:cytosolic calcium ion concentration elevation; TAS. 

InterPro; IPR000276; GPCR_Rhodpsn . 

Pfam; PF00001; 7tm_l; 1. 

PRINTS; PR00237; GPCRRHODOPSN . 

PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l; 1. 

PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

G-protein coupled receptor; Transmembrane; Polymorphism. 

EXTRACELLULAR (POTENTIAL) . 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL). 
EXTRACELLULAR ( POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC ( POTENTIAL) . 
BY SIMILARITY. 
V -> M (IN dbSNP: 1152186) . 
/FTId=VAR_011854 . 
N -> T (IN dbSNP: 1152187) . 
/FTId=VAR_011855 . 
P -> L (IN dbSNP: 1152188) . 
/ FT I d=VAR__0 11856. 
L -> V (IN REF. 2) . 
S -> A (IN REF. 2) . 

MW; 23E0AFED3B7BDEED CRC64; 
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FT 








FT 
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FT 








FT 


CONFLICT 


86 


86 


FT 


CONFLICT 


234 


234 


SQ 


SEQUENCE 


365 AA; 


40: 



Query Match 21.3* 
Best Local Similarity 31.1^ 
Matches 90; Conservative 



Score 352; DB 1; Length 365; 
: Pred. No. 5.2e-17; 

59; Mismatches 122; Indels 18; 



Gaps 



6; 



QY 
Db 



22 LSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS Y-A 80 

I I I : I : I I I : : : : | : : | : : : | : | : | : : | | : : : | | | | | | 
38 L P VS YAWFVL G L G LN APT LWL F I F RL R P WD AT AT YM FH LAL S DT L YVL S L P T L I Y Y YAA 97 



QY 
Db 

Qy 

Db 



81 NDKGT YGDVLCI SNRYVLHTNLYT S I LFLT FI SMDRYLLMKYP FREH FLQKKEFAI LI S L 14 0 

: : : I : I I : : : I I I I : I I I I I I : I I I : : I I : I I : I 

98 HNHWPFGTEICKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRALRWGRPRLAGLLCL 157 

141 AVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVMC 200 

I I I : I : : I I I : : I II : I : : I : I I : I I 

158 AWLVVAGCLVPNLFFVTTSNKGTTVLCHDTTRPEEFDHYVHFSSAVHGLLFGVPCLVTL 217 



Qy 201 FFYYKMWFLKRR SQQQATALPLDKPQRLWLAWI FS I LFTPYHIMRNLRIA 253 

I I II I I : : I : I : : : : | : : I I : I I I : 
Db 218 VCYGLMA RRLYQPLPGSAQSSSRL RS LRT I AWLTVFAVC FVPFHI T RT I YYL 270 

Qy 254 SRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

: I I I : : I : i I I I I I I : : I : I I I I I I I 

Db 271 ARL — LE7VDCRVLNIVNWYKVTRPLAS7^SCLDPVLYLLTGDKYRRQL 317 



Search completed: December 12, 2003, 18:07:3£ 
Job time : 19 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



December 12, 2003, 18:05:17 ; Search time 20 Seconds 

(without alignments) 
1524.274 Million cell updates/sec 

US-09-891-138A-2 
1650 

1 MAQNLSCENWLATEAILNKY REMLI S KFRQ YFKS LT S FRT 317 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIR_76: * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


474 


28.7 


373 


2 


JC4162 


2 


474 


28.7 


373 


2 


JC4737 


3 


472.5 


28.6 


362 


2 


S33733 


4 


373.5 


22.6 


373 


2 


A47556 


5 


353 


21.4 


355 


2 


JC4304 


6 


352 


21.3 


365 


2 


S68679 


7 


336.5 


20.4 


360 


2 


A57160 


8 


336 


20.4 


328 


2 


155450 


9 


335.5 


20.3 


375 


2 


A54946 


10 


333 


20.2 


308 


2 


150241 


11 


332.5 


20.2 


354 


2 


158186 


12 


332 


20.1 


328 


2 


JC4800 


13 


328 


19.9 


355 


2 


A45177 



Description 



P2Y receptor - bov 
G protein-coupled 
G protein-coupled 
ATP receptor P2u - 
orphan G protein-c 
G protein-coupled 
chemokine (C-C) re 
G protein-coupled 
P-2U nucleotide re 
G protein-coupled 
probable G protein 
P2Y6 receptor - hu 
chemokine (C-C) re 



14 


326.5 


19. 


8 


360 


2 


JC4587 


chemokine (C-C) re 


15 


325.5 


19. 


7 


388 


2 


JN0605 


somatostatin recep 


16 


318.5 


19. 


3 


359 


2 


A48921 


interleukin-8 rece 


17 


317.5 


19. 


2 


355 


2 


A55733 


G protein-coupled 


18 


316.5 


19. 


2 


384 


2 


A47249 


brain-specific som 


19 


315.5 


19. 


1 


355 


2 


JQ1231 


interleukin-8 rece 


20 


315 


19. 


1 


355 


2 


JC5067 


G protein-coupled 


21 


314 


19. 


0 


369 


2 


JC2083 


somatostatin recep 


22 


314 


19. 


0 


369 


2 


B41795 


somatostatin recep 


23 


314 


19. 


0 


370 


2 


JC5549 


heptahelical P2Y5- 


24 


313 


19. 


0 


344 


2 


T09508 


intron 17 purinerg 


25 


312.5 


18. 


9 


360 


2 


A53611 


interleukin-8 rece 


26 


311 


18. 


8 


358 


2 


A53752 


interleukin-8 rece 


27 


309.5 


18. 


8 


384 


2 


JC4 62 9 


somatostatin recep 


28 


309 


18. 


7 


355 


2 


149339 


macrophage inflamm 


29 


308 


18. 


7 


369 


2 


D41795 


somatostatin recep 


30 


307.5 


18. 


6 


369 


2 


A45291 


somatostatin recep 


31 


306 


18. 


5 


356 


2 


149340 


MIP-1 alpha recep t 


32 


306 


18. 


5 


362 


2 


A39714 


G protein-coupled 


33 


306 


18. 


5 


391 


2 


A39297 


somatostatin recep 


34 


305 


18. 


5 


346 


2 


S29248 


somatostatin recep 


35 


304.5 


18. 


5 


361 


2 


B45680 


G protein-coupled 


36 


303 


18. 


4 


350 


2 


A39445 


interleukin-8 rece 


37 


303 


18. 


4 


391 


2 


A41795 


somatostatin recep 


38 


303 


18. 


4 


391 


2 


C41795 


somatostatin recep 


39 


302 


18. 


3 


366 


1 


OORTB2 


bradykinin recepto 


40 


301.5 


18. 


3 


369 


2 


JC5068 


G protein-coupled 


41 


301 


18. 


2 


355 


2 


G02436 


chemokine (C-C) re 


42 


299.5 


18. 


2 


353 


2 


A53858 


bradykinin recepto 


43 


298 


18. 


1 


354 


2 


T09353 


G protein-coupled 


44 


297 


18. 


0 


387 


2 


169202 


G protein-coupled 


45 


295.5 


17. 


9 


363 


2 


157955 


somatostatin recep 



ALIGNMENTS 



RESULT 1 
JC4162 

P2Y receptor - bovine 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 12-Oct-1995 #sequence_revision 10-Nov-1995 #text_change 24-Sep-1999 
C; Accession: JC4162 

R;Henderson, D.J.; Elliot, D.G.; Smith, G.M. ; Webb, T.E.; Dainty, I. A. 

Biochem. Biophys. Res. Commun. 212, 648-656, 1995 

A; Title: Cloning and characterisation of a bovine P2Y receptor. 

A; Reference number: JC4162; MUID: 95352058 ; PMID: 7626079 

A; Accession: JC4162 

A;Molecule type: mRNA 

A; Residues: 1-373 <HEN> 

A;Cross-references: EMBL:X87628; NID : gl032484 ; PIDN : CAA60958 . 1 ; PID:gl032485 

A; Experimental source: aortic endothelial cell 

C; Genetics : 

A; Gene: bovp2y 

C; Superf amily : ATP receptor P2u 

C;Keywords: glycoprotein; phosphoprotein; receptor; transmembrane protein 
F; 52-77/Domain: transmembrane tstatus predicted <TM1> 



F; 8 8- Ill/Domain: transmembrane ((status predicted <TM2> 
F; 12 4-150/Domain : transmembrane ((status predicted <TM3> 
F; 171-191/Domain : transmembrane #status predicted <TM4> 
F; 2 14-2 37 /Domain : transmembrane ((status predicted <TM5> 
F; 261-282/Domain : transmembrane #status predicted <TM6> 
F; 3 05-32 8 /Domain : transmembrane ((status predicted <TM7> 

F; 11, 27, 113, 197/Binding site: carbohydrate (Asn) (covalent) ((status predicted 
F; 258/Binding site: phosphate (Ser) (covalent) (by protein kinase A) ((status 
predicted 

Query Match 28.7%; Score 474; DB 2; Length 373; 

Best Local Similarity 36.6%; Pred. No. 4e-32; 

Matches 105; Conservative 58; Mismatches 116; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

I I I I I : I I I I I I : : : : | | | | : : I I : I I I : : : I I : : I I I I I I 
Db 52 YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVLTLPALIFYY 111 

Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I : : I I I I I I I II I I I II : I I ■ : I I : I : I 

Db 112 FNKTDWI FGDAMCKLQRFI FHVNLYGS I LFLTCI SAHRYSGWYPLKSLGRLKKICNAVYI 171 

Qy 139 SLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : : I : II I I I I : I I : I : I I 

Db 172 SVLVWLIVWGISPILFYSGTGIRKNKTITCYDTTSDEYLRSYFIYSMCTTVAMFCVPLV 231 

Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

: : I : I I : : II I I I : : : : I : : : I : I : I : : : : I I 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKS I YLVI I VLTVFAVS YI P FHVMKTMNLRARL 288 

Qy 2 5 7 D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML * 301 

I I : I : : I : I I I I I I I : : I I M I I I : I I 

Db 289 DFQTPEMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRRL 335 



RESULT 2 
JC4737 

G protein-coupled receptor P2Y1 - human 

N;Alternate names: P2Y1 purinergic receptor; P2Y1 purinoceptor 
C; Species: Homo sapiens (man) 

C;Date: 10-May-1996 #sequence_revision 16-Aug-1996 #text_change 17-Nov-2000 
C;Accession: JC4737; JC4615; S54253 

R; Janssens, R. ; Communi, D.; Pirotton, S.; Samson, M. ; Parmentier, M. ; 
Boeynaems, J.M. 

Biochem. Biophys . Res. Commun. 221, 588-593, 1996 

A; Title: Cloning and tissue distribution of the human P2Y1 receptor. 
A; Reference number: JC4737; MUID: 96205320; PMID: 8630005 
A; Accession: JC4737 
A; Molecule type: DNA 
A; Residues: 1-373 <JAN> 

A; Cross-references: GB:S81950; NID : gl839438 ; PIDN: AAB47091 . 1 ; PID:gl839439 
R;Ayyanathan, K.; Webbs, T.E.; Sandhu, A.K.; Athwal, R.S.; Barnard, E.A.; 
Kunapuli, S.P. 

Biochem. Biophys. Res. Commun. 218, 783-788, 1996 

A; Title: Cloning and chromosomal localization of the human P2Y1 purinoceptor. 
A; Reference number: JC4615; MUID: 96158962 ; PMID:8579591 
A; Accession: JC4615 



A;Molecule type: mRNA 
A; Residues: 1-373 <AYY> 

A; Cross-references: GB:U42029; NID : gl 1 47730 ; PIDN : AAA97 872 . 1 ; PID:gll47731 

A; Experimental source: erythro leukemia cells 

R;Leon, C. ; Vial, C; Cazenave, J.; Gachet, C. 

submitted to the EMBL Data Library, May 1995 

A; Description : Cloning of a human putative P2Y receptor. 

A; Reference number: S54253 

A;Accession: S54253 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-137,139-373 <LE0> 

A; Cross-references : EMBL : Z 4 92 05; NID:g798835; PIDN : CAA89066 . 1 ; PID:g798836 

C; Comment: This receptor belongs to a family of G protein-coupled receptors. It 

responds to both ADP and ATP, and has several serine/ threonine phosphorylation 

residues in the carboxyl terminus. 

C; Genetics: 

A; Gene: p2Yl; GDB : P2RY1 

A; Cross-references: GDB: 677125; OMIM: 601167 

A; Map position: 3pter-3qter 

C; Superf amily : ATP receptor P2u 

C; Keywords: G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F; 52-7 7 /Domain: transmembrane #status predicted <TM1> 
F; 8 8-111/Domain : transmembrane #status predicted <TM2> 
F; 124-152/Domain: transmembrane ftstatus predicted <TM3> 
F; 171-191/Domain : transmembrane #status predicted <TM4> 
F; 214-237/Domain: transmembrane (fstatus predicted <TM5> 
F; 261-282/Domain : transmembrane #status predicted <TM6> 
F; 305-328/Domain : transmembrane #status predicted <TM7> 

F; 11 , 27 , 113, 197/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 258, 336/Binding site: phosphate (Ser) (covalent) (by protein kinase A) #statu 
predicted 

F; 330, 339/Binding site: phosphate (Thr) (covalent) (by protein kinase C) ({statu 
predicted 

F; 343/Binding site: phosphate (Ser) (covalent) (by protein kinase C and 
calmodulin-dependent kinase) ((status predicted 

Query Match 28.7%; Score 474; DB 2; Length 373; 

Best Local Similarity 37.3%; Pred. No. 4e-32; 

Matches 107; Conservative 55; Mismatches 117; Indels 8; Gaps 6; 

Qy 20 YYLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSY 79 

11111:111111 : : : : I I I I : : | | : | | | : : : | | : : | || I I I 
Db 52 YYLPAWILVFIIGFLGNSVAIWMFVFHMKPWSGISVYTytFNIJ^IADFLYVLTLPALIFYY 111 



Qy 80 ANDKG-TYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILI 138 

I : I I : I I :: I I I I I I I I I I I I II : I i : I I : I I I 

Db 112 FNKTDWIFGDAMCKLQRFIFHWLYGSILFLTCISAHRYSGWYPLKSLGRLKKKNAICI 171 

Qy 139 SLAWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLS 197 

I : I I : I : : I : I : : I I : II I I I I : I I : I : I I 

Db 172 SVLWLI VVVAISPILFYSGTGVRKNKTITCYDTTSDEYLRSYFIYSMCTTV7\MFCVPLV 231 



Qy 198 VMCFFYYKMWFLKRRSQQQATALPL-DKPQRLWLAWIFSILFTPYHIMRNLRIASRL 256 

: : I : I I : : II I I I : : : : I : : : I : I : I : : : : I I 
Db 232 LILGCYGLIVRALIYKDLDNS PLRRKSI YLVIIVLTVFAVSYI PFHVMKTMNLRARL 288 



Qy 257 D- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGDHYREML 301 

I I I : : I : I I I I I I I : : I I I I I I I : I I 

Db 289 D FQT PAMCAFN DRVYAT YQVT RGLAS LN S CVD P I L YFLAGDT FRRRL 335 

RESULT 3 
S33733 

G protein-coupled receptor ~ chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 24-Sep-1999 
C;Accession: S33733 

R;Webb, T.E.; Simon, J.; Krishek, B.J.; Bateson, A.N.; Smart, T.G.; King, B.F.; 
Burnstock, G. ; Barnard, E.A. 
FEBS Lett. 324, 219-225, 1993 

A; Title: Cloning and functional expression of a brain G-protein-coupled ATP 
receptor . 

A; Reference number: S33733; MUID: 93285340; PMID: 8508924 
A;Accession: S33733 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-362 <WEB> 

A; Cross-references: EMBL:X73268; NID:g395084; PIDN : CAA517 16 . 1 ; PID:g395085 
C; Super family : ATP receptor P2u 

C; Keywords: G protein-coupled receptor; transmembrane protein 

1 

Query Match 28.6%; Score 412.5; DB 2; Length 362; 

Best Local Similarity 35.9%; Pred. No. 5.2e-32; 

Matches 110; Conservative 58; Mismatches 121; Indels 17; Gaps 8; 

Qy 10 WLA TEAILNK YYLSAFYAI EFI FGLLGNVT VVFGYLFCMKNWNS SNVYLFN 60 

II I : II III I : I I I I I I : : : : | | : | : : | | : | | 

Db 22 WAAGNATTKCSLTKTGFQFYYLPTWILVFITGFLGNSVAIWMFVFHMRPWSGISVYMFN 81 

Qy 61 LSI SDFAFLCTLPI LI KS YANDKG-T YGDVLCI SNRYVLHTNLYTS I LFLTFI SMDRYLL 119 

I : : : I I : : I I I I I II : I I I : I I : : I I I I I II I I I I I : I I 

Db 82 LALADFLYVLTLPALI FYYFNKTDWI FGDVMCKLQRFI FHVNLYGS ILFLTCI SVHRYTG 141 

Qy 120 MKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINS-VPKEEGSNCIDYASSGNPE 17 8 

: : I : I I : I : : I I I I I I : I : I : : I : : II : 

Db 142 VVliPLKS LGRLKKKNAVYVS S LVWALWAVI AP I L FYS GTGVRRNKT I TC YDTTADE YLR 201 

Qy 179 HNLIYSLCLTLLGFLI PLSVMCFFYYKMWFLKRRSQQQATALPL-DKPQRLVA/XAVVI F 237 

: I I : I I : III I : I : I I : : II I I I : : : : I 
Db 202 SYFVYSMCTTVFMFCIPFIVILGCYGLIVKALIYKDLDNS PLRRKS I YLVII VLTVF 258 

Qy 238 SI LFTPYHIMRNLRIASRLD- SWPQGCT-QKAI KS I YTLTRPLAFLNSAINPI FYFLMGD 295 

: : : I : I : I : I : : I I I III : : I : I I I I II I : : I I I I I I I 

Db 259 AVS YLP FHVMKTLNLRAJRLD FQT PQMCAFNDKVYAT YQVT RGLAS LNS CVDP I LYFLAGD 318 

Qy 296 HYREML 301 

: I I 

Db 319 T FRRRL 324 



RESULT 4 
A47556 



ATP receptor P2u - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 13-Jan-1995 #sequence_revision 13-Jan-1995 #text_change 24-Sep-1999 
C; Access ion: A4 7556 

R;Lustig, K.D.; Shiau, A.K.; Brake, A. J.; Julius, D. 
Proc. Natl. Acad. Sci. U.S.A. 90, 5113-5117, 1993 

A;Title: Expression cloning of an ATP receptor from mouse neuroblastoma cells. 

A;Reference number: A47556; MUID : 932 8 1707 ; PMID:7685114 

A;Accession: A47556 

A; Status: preliminary 

A;Molecuie type: mRNA 

A; Residues: 1-373 <LUS> 

A;Cross-references : GB:L14751; NID:g309457; PIDN : AAA398 71 . 1 ; PID:g309458 
C; Superf amily : ATP receptor P2u 
C; Keywords: transmembrane protein 

Query Match 22.6%; Score 373.5; DB 2; Length 373; 

Best Local Similarity 33.5%; Pred. No. 9.8e-24; 

Matches 94; Conservative 50; Mismatches 130; Indels 7; Gaps 5; 

Qy 19 KY-YLSAFYAIEFIFGLLGNWWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : I I II : : : I : I I I : I I : I : I : : I I : : I I : I : 
Db 32 KYVLLPVSYGWCVLGLCLNWALYIFLCRLKTWNASTTYMFHLAVSDSLYAASLPLLVY 91 



Qy 7 8 S YA-NDKGT YGDVLCI SNRYVLHTNL YT S I L FLT FI SMDRYLLMKYP FREH FLQKKEFAI 136 

II I : III I : : : M I I I I I I ! I I I : I i : I : : I 

Db 92 YY7VRGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLHSLRWGRARYAR 151 



Qy 137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

:: I I I I I : I I : : : II:: I : I I : I I : I 

Db 152 RVAAVWVLVLACQAPVLYFVTTSVRGTRITCHDTSARELFSHFVAYSSVMLGLLFAVPF 211 



Qy 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQ — RLWLAWI FS I LFTP YHIMRNLRI AS 254 

I ! : I I I : : II I : I : I : : I : : I I : I : I I : 

Db 212 SVILVCYVLMARRLLKPAYGTTGGLPRAKRKSVRTIALVLAVFALCFLPFHVTRTLYYSF 271 



Qy 255 RLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMG 294 

I I II II I : I I I I I I I : : I : I I I I 

Db 272 R — SLDLSCHTLNAINMAYKITRPLASANSCLDPVLYFLAG 310 



RESULT 5 
JC4304 

orphan G protein-coupled receptor - human 
N;Alternate names: V28 protein 
C; Species: Homo sapiens (man) 

C;Date: 16-Nov-1995 #sequence_revision 08-Feb-1996 #text_change 19-May-2000 
C; Accession: JC4304 

R;Raport, C.J.; Schweickart, V.L.; Eddy Jr., R.L.; Shows, T.B.; Gray, P.W. 
Gene 163, 295-299, 1995 

A;Title: The orphan G-protein-coupled receptor-encoding gene V2 8 is closely 
related to genes for chemokine receptors and is expressed in lymphoid and 
neuraltissues . 

A;Reference number: JC4304; MUID : 96011651 ; PMID:7590284 
A;Accession: JC4304 
A;Molecule type: mRNA 
A; Residues: 1-355 <RAP> 



A; Cross-references : GB:U20350; NID:g665580; PIDN : AAA91783 . 1 ; PID:g665581 
A; Experimental source: peripheral blood mononuclear cell 
C; Comment: This protein is a cell-surface receptor which recognizes 
extracellular signals and transduces those signals into an intracellular 
response . 

C; Comment: This protein is a key regulator of many immune and homeostatic 
responses, and interacts between the nervous and immune systems. 
C; Genetics : 
A; Gene: v28 

A;Map position: 3pter-p21 

C; Superf amily : vertebrate rhodopsin 

C; Keywords: G protein-coupled receptor; lymphokine; transmembrane protein 

F; 35-57 /Domain : transmembrane #status predicted <TM1> 

F; 66-88/Domain : transmembrane ((status predicted <TM2> 

F; 104-125/Domain : transmembrane #status predicted <TM3> 

F; 14 6- 165 /Domain : transmembrane #status predicted <TM4> 

F; 197-217/Domain : transmembrane #status predicted <TM5> 

F; 230-254/Domain : transmembrane #status predicted <TM6> 

F; 275-2 9 6/ Domain : transmembrane #status predicted <TM7> 

Query Match 21.4%; Score 353; DB 2; Length 355; 

Best Local Similarity 28.1%; Pred. No. 4.8e-22; 

Matches 84; Conservative 66; Mismatches 133; Indels 16; Gaps 6; 

Qy 16 I LNKYYLSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI L 75 

: : I I I I : : I I I . : I I : I I I I : : : : I I I I : : I I hill 

Db 28 VFGTVFLS I FYSVI FAI GLVGNLLVVFALTNSKKPKSVTDI YLLNLALS DLLFVATLPFW 87 



Qy 76 IKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFA 135 

I : I I : : : I : I I I : I I I : I I I I : : : 

Db 88 THYLINEKGLH-NAMCKFTTAFFFIGFFGSI FFITVI S I DRYLAIVLAANSMNNRTVQHG 146 



Qy 136 ILISLAVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLI 194 

: I I I I I I I I I | : : : | : | | : : : : MM: 

Db 147 VT I S LGVWAAAI LVAAPQ FMF TKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLL 202 



Qy 195 PLSVMCFFYYfOytV-VFLKRRSQQQATALPLDKPQRLWLAVVIFSILFTPYHIMRNLRIA 253 

I I : I : I : : : : : : : : | | : : I : : I I : : I : : I I I : : I I 

Db 203 PLLIMSYCYFRIIQTLFSCKNHKKAKAI KLILLWIVFFLFWTPYNVMIFLETL 256 



Qy 254 SRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINP1FYFLMGDHYREMLISKFRQYFKSL 312 

1:1 : I : : : : I : I I : : I I : I I : : I I : ill 

Db 257 KLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLI YAFAGEKFRRYL YHLYGKCL 312 



RESULT 6 
S68679 

G protein-coupled receptor - human 
C; Species: Homo sapiens (man) 

C;Date: 15-Feb-1997 #sequence_revision 13-Mar-1997 #text__change 20-Jun-2000 
C; Accession: S 68 67 9 

R;Stam, N.J.; Klomp, J.; van de Heuvel, M. ; Olijve, W. 
FEBS Lett. 384, 260-264, 1996 

A; Title: Molecular cloning and characterization of a novel orphan receptor 
(P(2P)) expressed in human pancreas that shows high structural homology to the 
P(2U) purinoceptor . 

A;Reference number: S68679; MUID : 96197801 ; PMID:8617367 



A; Accession : S 68 67 9 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-365 <STA> 

A;Cross-references: EMBL : X96597 ; NID : gl296631 ; PIDN : CAA65415 . 1 ; PID:gl296632 

C;Superfamily: ATP receptor P2u 

C; Keywords: G protein-coupled receptor 

Query Match 21.3%; Score 352; DB 2; Length 365; 

Best Local Similarity 31.1%; Pred. No. 6e-22; 

Matches 90; Conservative 59; Mismatches 122; Indels 18; Gaps 6; 

Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNWLFNLSISDFAFLCTLPILIKSY-A 80 

I | | : | : | | | : : : : | : : | : : : | : | : | : : | | : : : | | | | | | 
Db 38 LPVSYAWFVLGLGLNAPTLWLFIFRLRPWDATATYMFHLALSDTLYVLSLPTLIYYYAA 97 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISL 140 

: : : I : I I : : : I I I I : I I I I I I - : I I I : : I I : I I : I 

Db 98 HNHWPFGTEICKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRALRWGRPRLAGLLCL 157 

Qy 141 AVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVMC 200 

I I I : I : : I I I : : I II : I : : I : I I : I I 

Db 158 AWLWAGCLVPNLFFVTTSNKGTTVLCHDTTRPEEFDHYVHFSSAVMGLLFGVPCLVTL 217 



Qy 2 01 FFYYKMWFLKRR SQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIA 253 

II II I I : : I : I : : : : | : : | | : | | | : 
Db 218 VCYGLMA RRLYQPLPGSAQSSSRL RS LRT I AWLTVFAVC FVP FH I TRT I YYL 270 



Qy 254 SRLDSWPQGC-TQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYREML 301 

: : I I : : ! : I I I I I I I :: I : I I I I I I I 

Db 271 ARL — LEADCRVLNIVNWYKVTRPLASANSCLDPVLYLLTGDKYRRQL 317 



RESULT 7 
A57160 

chemokine (C-C) receptor 4 - human 
N; Alternate names: C-C CKR-4 
C; Species: Homo sapiens (man) 

C;Date: 10-Nov-1995 #sequence_revision 10-Nov-1995 #text_change 21-Jul-2000 
C;Accession: A57160 

R; Power, C.A.; Meyer, A.; Nemeth, K. ; Bacon, K.B.; Hoogewerf, A.J.; Proudfoot, 
A.E.I. ; Wells, T.N.C. 

J. Biol. Chem. 270, 19495-19500, 1995 

A; Title: Molecular cloning and functional expression of a novel CC chemokine 

receptor cDNA from a human basophilic cell line. 

A; Reference number: A57160; MUID: 95370289; PMID: 7642634 

A;Accession: A57160 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-360 <POW> 

A; Cross-references: GB:X85740; NID : gl37 0103 ; PIDN : CAA59743 . 1 ; PID:g971452 

A; Note : source clone K5-5 

C; Genetics : 

A; Gene : GDB : CMKBR4 

A; Cross-ref erences : GDB: 677463 

A;Map position: 3p21-3p21 

C; Superf amily : vertebrate rhodopsin 



C; Keywords: G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F; 40-65/Domain : transmembrane ((status predicted <TM1> 
F; 76-97/Domain : transmembrane #status predicted <TM2> 
F; 112-133/Domain: transmembrane ((status predicted <TM3> 
F; 15 1-175 /Domain : transmembrane #status predicted <TM4> 
F; 208-226/Domain; transmembrane ((status predicted <TM5> 
F; 243-264/Domain: transmembrane ((status predicted <TM6> 
F;291-308/Domain: transmembrane ((status predicted <TM7> 
F;29-276, 1 10-187/Disulf ide bonds: ((status predicted 

F; 72 , 3 50/ Binding site: phosphate (Ser) (covalent) (by casein kinase II) ((status 
predicted 

F; 145/Binding site: phosphate (Ser) (covalent) (by protein kinase C) ((status 
predicted 

F; 183, 194/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

F; 321/Binding site: phosphate (Thr) (covalent) (by protein kinase C) ((status 

predicted 

Query Match 20.4%; Score 336.5; DB 2; Length 360; 

Best Local Similarity 29.4%; Pred. No. 1.2e-20; 

Matches 93; Conservative 60; Mismatches 132; Indels 31; Gaps 9; 

Qy 14 EAILNKYYL SAF YAI EFI FGLLGN VTWFGYLFCMKNWN 52 

I : I : I I I II I :: I : I I I I I I I I III 

Db 13 ESIYSNYYLYESIPKPCTKEGIKAFGELFLPPLYSLVFVFGLLGNSVWL-VLFKYKRLR 71 

Qy 53 S-SNVYLFNLSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRYVLHTNLYTSILFLTF 111 

I :: I I I I I : I I I I : : I I II I : : I II : : I : I I : 

Db 72 SMTDWLLNLAISDLLFVFSLPFW-GYYAADQWVFGLGLCKMI SWMYLVGFYSGIFFVML 130 

Qy 112 ISMDRYLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDY 171 

: I : I I I I : : : : : : I M I : : II I : : I 

Db 131 MSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVFASLPGFLFSTCYTERNHTYCKTK 190 

Qy 172 ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 231 

I : : : I I : : I I : I I I : I III:: I : I : ! : : : 

Db 191 YSLNSTTWKVLSSLEINILGLVIPLGIM-LFCYSMII RTLQHCKNEKKNKAVKMIF 245 

Qy 232 LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 290 

I I : I : I I I : I : I I : III::: I | | | : : : I I I I 

Db 24 6 AWVLFLGFWTPYNIVLFLETLVELEVL-QDCTFERYLDYAIQATETLAFVHCCLNPIIY 304 

Qy 291 FLMGDHYREMLI SKFR 306 

I : I : : I : : : I : 
Db 305 FFLGEKFRKYILQLFK 320 



RESULT 8 
155450 

G protein-coupled P2 receptor - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 02-Jul-1996 ffsequence_revision 02-Jul-1996 #text_change 19-May~2000 
C;Accession: 155450 

R;Chang, K. ; Hanaoka, K.; Kumada, M. ; Takuwa, Y. 
J. Biol. Chem. 270, 26152-26158, 1995 

A;Title: Molecular cloning and functional analysis of a novel P2 nucleotide 
receptor. 



A;Reference number: 155450; MUID: 96064682 ; PMID:7592819 
A; Accession: 155450 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-328 <RES> 

A;Cross-references: GB:D63665; NID : gl066007 ; PIDN : BAA09816 . 1 ; PID:gl066008 

C; Superf amily : ATP receptor P2u 

C; Keywords: G protein-coupled receptor 

Query Match 20.4%; Score 336; DB 2; Length 328; 

Best Local Similarity 29.9%; Pred. No. 1.2e-20; 

Matches 85; Conservative 52; Mismatches 137; Indels 10; Gaps 4; 

Qy 22 LSAFYAI EFI FGLLGNWVVFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YA- 80 

I I : : : I I III: : III I I : : : I : I : I I : I I : I I 

Db 29 LPPVYSVVXWGLPLNVCVIAQICASRRTLTRSAWTLNLALADLLYACSLPLLIYNYAR 8 8 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : I I : I I :: : I I : I I I I I I I I I I I : : I I : I :: 

Db 8 9 GDHWPFGDLACRLVRFLFYANLHGSILFLTCISFQRYLGICHPLAPWHKRGGRRAAWWC 14 8 

Qy 140 LAWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLI YSLCLTLLGFLIPLSVM 199 

I I : I I : I I I : : : I I : I I : I I : : I I I : I : : 

Db 14 9 GVWLVVTAQCLPTAVFAATGIQRNRTVCYDLSPPILSTRYLPYGM7VLTVIGFLLPFTAL 2 08 

Qy 200 C FFY YKMWFLKRRS QQQATAL P L DKPQRLWLAWIFSILFTPYHIMRNLRIAS 254 

1:1 : I : I I I : I I : I : : I I I I : I I : : I 

Db 209 LACYCRMA RRLCRQDGPAGPVAQERRSKAARMAVVVAAVFVI S FLPFHITKTAYLAV 2 65 

Qy 255 RLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYR 298 

I : : | | | | | M :: I I :: : I 

Db 266 RSTPGVSCPVLETFAAAYKGTRPFASANSVLDPILFYFTQQKFR 309 



RESULT 9 
A54946 

P-2U nucleotide receptor - human 
C; Species: Homo sapiens (man) 

C;Date: ll-Nov-1994 #sequence_revision ll-Nov-1994 #text_change 17-Mar-1999 
C; Access ion: A54 94 6 

R;Parr, C.E.; Sullivan, D.M.; Paradiso, A.M.; Lazarowski, E.R.; Burch, L.H.; 
Olsen, J.C.; Erb, L. ; Weisman, G.A. ; Boucher, R.C.; Turner, J.T. 
Proc. Natl. Acad. Sci. U.S.A. 91, 3275-3279, 1994 

A;Title: Cloning and expression of a human P-2U nucleotide receptor, a target 
for cystic fibrosis pharmacotherapy. 

A; Reference number: A54946; MUID: 94211846; PMID: 8159738 

A; Accession: A54 94 6 

A; Status: preliminary 

A;Molecule type : mRNA; protein 

A; Residues: 1-375 <PAR> 

A; Cross-ref erences : GB:U07225 

A; Note: parts of this sequence were confirmed by protein sequencing 
C; Genetics : 

A;Gene: GDB : P2RY2 ; HP2U; P2U 

A;Cross-references : GDB: 362713; OMIM: 600041 
A; Map position: llql3 . 5-1 lql4 . 1 
C; Superf amily : ATP receptor P2u 



C;Keywords: G protein-coupled receptor; transmembrane protein 



Query Match 20.3%; Score 335.5; DB 2; Length 375; 

Best Local Similarity 33.1%; Pred. No. 1.5e-20; 

Matches 93; Conservative 52; Mismatches 127; Indels 9; Gaps 7; 

Qy 19 KY-YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

II I I : : I I I : : : I : I I I : I I : I : I : : I I : : I I : I : 
Db 32 KYVLLPVSYGWCVLGLCLNAVGLYIFLCRLKTWNASTTYMFHLAVSDALYAASLPLLVY 91 

Qy 78 SYA-NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAI 136 

II I : III I ::: I I I I II I I I I I I : I I : II : : I 

Db 92 YYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLRSLRWGRARYAR 151 

Qy 137 LISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPL 196 

: : I I I I I I : I I : : : I I : : : I I : I I : I 

Db 152 RVAGAVWLVLACQAPVLYFVTTSARGP-LTCHDTSAPELFSRFVAYSSVMLGLLFAVPF 210 

Qy 197 SVMCFFYYKMWFLKRRSQQQATALPLDKPQRLWLAW — IFSILFTPYHIMRNLRIAS 254 

: I : I ||:: : II I : : : I I I : I : : I I : I : I I ': 
Db 211 AVILVCYVLMARRLLKPAYGTSGGLPRAKRKSVRTIAWLAVFALCFLPFHVTRTLYYSF 270 

Qy 255 RLDSWPQGC-TQKAI KS I YTLTRPLAFLNSAINPI FYFLMG 2 94 

I I II II I : I I I I I I : : I : I I I I 

Db 271 R— SLDLSCHTLNAINMAYKVTR-LASANSCLDPVLYFLAG 308 



RESULT 10 
150241 

G protein-coupled receptor 6H1 - chicken 
N;Alternate names: purinoceptor 6H1 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Sep-1996 #sequence_revision 13-Sep-1996 #text_change 02-Jun-2000 
C;Accession: 150241; JC4618 

R; Kaplan, M.H.; Smith, D.I.; Sundick, R.S. 
J. Immunol. 151, 628-636, 1993 

A; Title: Identification of a G protein coupled receptor induced in activated T 
cells . 

A; Reference number: 150241; MUID : 93329058 ; PMID: 8393036 
A;Accession: 150241 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-308 <KAP> 

A;Cross-references: GB:L06109; NID:g304383; PIDN : AAB06587 . 1 ; PID:g304384 

R;Webb, T.E.; Kaplan, M.G.; Barnard, E.A. 

Biochem. Biophys . Res. Commun. 219, 105-110, 1996 

A;Title: Identification of 6H1 as a P2Y purinoceptor: P2Y5. 

A; Reference number: JC4618; MUID : 96190677 ; PMID: 8619790 

A; Accession: JC4618 

A;Molecule type: mRNA 

A; Residues: 1-308 <WEB> 

A; Cross-references: GB:L06109; NID:g304383; PIDN : AAB06587 . 1 ; PID:g304384 
A; Experimental source: T-cells 

C; Comment: This receptor plays a role in T-cell activation. 
C; Genetics: 
A; Gene: p2Y5 

C; Superf amily : ATP receptor P2u 



C; Keywords: G protein-coupled receptor; transmembrane protein 
F; 15-4 O/Domain : transmembrane ((status predicted <TM1> 
F; 5 1-7 4 /Domain : transmembrane ((status predicted <TM2> 
F; 89-109/Domain: transmembrane ((status predicted <TM3> 
F; 133-153/Domain : transmembrane ((status predicted <TM4> 
F; 177-201/Domain: transmembrane #status predicted <TM5> 
F;227-248/Domain: transmembrane #status predicted <TM6> 
F;269-292/Domain: transmembrane {(status predicted <TM7> 



Query Match 20.2%; Score 333; DB 2; Length 308; 

Best Local Similarity 28.9%; Pred. No. 1.9e-20; 

Matches 88; Conservative 52; Mismatches 123; Indels 42; Gaps 6; 

Qy 19 KYYL-SAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIK 77 

III : : : I : | I : I : : : I : I I : I : I I : I I I I : I I I I 

Db 13 KYTLYGCVFSMVFVLGLIANCVAIYIFTFTLKVRNETTTYMLNLAISDLLFVFTLPFRIY 72 



Qy 78 SYANDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAIL 137 

: : I I I I I : : : I I : I I I I I I I I I : I I : I : : I I I I : I I : 

Db 73 YFWRNWPFGDVLCKI SVTLFYTNMYGS I LFLTCI SVDRFLAI VHPFRS KTLRTKRNARI 132 



Qy 138 ISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLC 186 

: : I I I I II : | | : : | 

Db 133 VCVAVWITVL AGSTPASFFQSTNRQNNTEQRTCFENFPESTWKT 176 



Qy 187 LTLLGFLIP — LSVMCFFYYKMW-FLKRRSQQQATALPLDKPQRLWLAW 235 

: : : I I I I hi I II: I : I I : : : : : I 

Db 177 YLSRIVIFIEIVGFFIPLILNVTC STMVLRTLNKPLTLSRNKLSKKKVLKMIFVHLV 233 



Qy 236 IFSILFTPYHIMRNLRIASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGD 295 

II I I I : I I I : I | : : : : | : | : | I : I I I : I 

Db 234 IFCFCFVPYNITLILYSmRTQTWINCSVVTAVRTMYPVTLCIAVSNCCFDPIVYYFTSD 293 

Qy 296 HYREM 300 

I : 

Db 294 TNSEL 298 



RESULT 11 
158186 

probable G protein-coupled receptor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 26-Jul-1996 (tsequence_revision 26-Jul-1996 #text_change 21-Jul-2000 
C;Accession: 158186 

R;Harrison, J.K.; Barber, C.M.; Lynch, K.R. 
Neurosci. Lett. 169, 85-89, 1994 

A; Title: cDNA cloning of a G-protein-coupled receptor expressed in rat spinal 

cord and brain related to chemokine receptors. 

A; Reference number: 158186; MUID : 94323113 ; PMID: 8047298 

A;Accession: 158186 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-354 <RES> 

A;Cross-references: EMBL:U04808; NID : g2558635; PIDN: AAB87093 . 1 ; PID:g439861 
C; Superf amily : vertebrate rhodopsin 
C; Keywords: G protein-coupled receptor 



Query Match 20.2%; Score 332.5; DB 2; 

Best Local Similarity 27.9%; Pred. No. 2.5e-20; 
Matches 82; Conservative 66; Mismatches 125; 



Length 354; 



Indels 21; Gaps 



6; 



Qy 21 YLSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYA 80 

: I I I I : : I I I I : I I : I I I : : : : : I I I I : : I I I : I I I 

Db 34 FLSIFYSLVFTFGLVGNLLWLALTNSRKSKSITDIYLLNLALSDLLFVATLPFWTHYLI 93 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISL 140 

: : I : : : I : I I : I I I : I II I : : : : I I I 

Db 94 SHEGLH-NAMCKLTTAFFFI GFFGGI FFITVI S I DRYLAI VLAANSMNNRTVQHGVTI SL 152 

Qy 141 AVWALVTLEVLPMLTFINSVPKEEGSNCI-DYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

III I I I I : : I : I I : : : : : I I I : : II : I 

Db 153 GVWAAAI LVAS P Q FMF TKRKDNECLGDYPEVLQEIWPVLRNSEVNILGFVLPLLIM 208 

Qy 200 CFFYYKMV-VFLKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLDS 258 

I I : : : I : : : : : | | : I I : : I I I : I : : I I I : I : I : 

Db 209 SFCYFRIVRTLFSCKNRKKARAI RLILLWWFFLFWTPYNIVIFLETLKFYNF 262 

Qy 259 WPQGCTQKAIKS I YTLTRPLAFLNSAINPI FYFLMGDHYREMLI SKFRQYFKSL 312 

: I : : : : : : I : I I : : I I I I : I I I : I ' : I 

Db 2 63 FPSCGMKRDLRWALSVTETVAFSHCCLNPFIYAFAGE KFRRYLRHL 308 



RESULT 12 
JC4800 

P2Y6 receptor - human 

C; Species: Homo sapiens (man) 

C;Date: 15-Oct-1995 #sequence_revision 16-Aug-1996 #text_change 17-Nov-2000 
C;Accession: JC4800; G02514 

R;Communi, D. ; Parmentier, M. ; Boeynaems, J.M. 
Biochem. Biophys . Res. Commun. 222, 303-308, 1996 

A; Title: Cloning, functional expression and tissue distribution of the human 
P2Y6 receptor. 

A; Reference number: JC4800; MUID: 96222498 ; PMID: 8670200 

A; Accession: JC4 8 00 

A; Molecule type: mRNA 

A; Residues: 1-328 <COM> 

A;Cross-references : EMBL:X97058 

A; Experimental source: placenta 

R;Hammet, F. ; Southey, M.C.; Somers, G.R.; Hutchins, A.M.; Venter, D.J. 
submitted to the EMBL Data Library, March 1996 
A; Reference number: H01373 
A;Accession: G02514 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 'M 1 , 4-328 <HAM> 

A; Cross-references: EMBL:U52464; NID: gl407632 ; PIDN : AAB03572 . 1 ; PID:gl407633 
C; Genetics : 
A; Gene: P2Y6 

C; Superf amily : ATP receptor P2u 

C; Keywords: glycoprotein; placenta; receptor; transmembrane protein 
F; 26-52/Domain : transmembrane ffstatus predicted <TMM1> 
F; 63-86/Domain: transmembrane #status predicted <TMM2> 
F; 104-122/Domain : transmembrane #status predicted <TMM3> 
F; 143-167/Domain : transmembrane #status predicted <TMM4> 



F; 193-216/Domain: transmembrane #status predicted <TMM5> 
F; 241-264/Domain : transmembrane #status predicted <TMM6> 
F; 283-305/Domain: transmembrane #status predicted <TMM7> 
F;5, 173/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.1%; Score 332; DB 2; Length 328; 

Best Local Similarity 30.6%; Pred. No. 2.5e-20; 

Matches 90; Conservative 48; Mismatches 136; Indels 20; Gaps 6; 

Qy 22 LSAFYAIEFIFGLLGNVTWFGYLFCMKNWNSSNVYLFNLSISDFAFLCTLPILIKSYA- 80 

I I : II I = I : : : I I I I : : : I : I : I i : I I : M 

Db 29 LPPVYSAVLTyVGLPLNICVITQICTSRRALTRTAVYTLNIALADLLYACSLPLLIYNYAQ 88 

Qy 81 NDKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFRE-HFLQKKEFAILIS 139 

I : I I I I : : : E I : I I I I I I I I I I I : : I I : I I : 

Db 89 GDHWPFGDFACRLVRFLFY7\NLHGSILFLTCISFQRYLGICHPLAPWHKRGGRRAAWLVC 148 

Qy 140 LAVWALVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVM 199 

: I I I I I : I I I : : : I I : I : I : I I : : I I I : I : : 

Db 14 9 VAWIxA.VTTQCLPTAIFAATGIQRNRTVCYDLSPP7VLATHYMPYGMALTVIGFLLPFAAL 2 08 

Qy 200 CFFYYKMWFLKRRSQQQATALPL DKPQRLVVLAVVI FS I LFT P YHIMRNLRI AS 254 

I : I : I II: I I : I : I : I I I : I I : : I 

Db 209 LACYCLLAC RLCRQDGPAEPVAQERRGKAARMAVWAAAFAISFLPFHITKTAYLAV 265 

Qy 255 RLDSWPQGCTQKAI KS I YTLTRPLAFLNSAINP I - FYFLMGDHYREMLI SKFRQ 307 

I : I : I III! I I :: I I I I I III: 
Db 2 66 RSTPGVPCTVLEAFAAAYKGTRPFASANSVLDPILFYFTQ KKFRR 310 



RESULT 13 
A45177 

chemokine (C-C) receptor 1 - human 

N;Alternate names: C-C CKR-1; macrophage inflammatory protein- 1-alpha receptor 
C; Species: Homo sapiens (man) 

C;Date: 30-Sep-1993 #sequence__revision 30-Sep-1993 #text_change 13-Aug-1999 
C;Accession: A45177; 155671 

R;Neote, K. ; DiGregorio, D.; Mak, J.Y.; Horuk, R. ; Schall, T.J. 
Cell 72, 415-425, 1993 

A;Title: Molecular cloning, functional expression, and signaling characteristics 
of a C-C chemokine receptor. 

A; Reference number: A45177; MUID: 93161416; PMID: 7679328 
A;Accession: A45177 

A; Status : nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-355 <NEO> 

A; Cross-references: GB:L10918; NID:g292416; PIDN : AAA36543 . 1 ; PID:g292417 
A; Experimental source: HL60 cells 

A;Note: sequence extracted from NCBI backbone (NCBIP : 124876) 
R;Gao, J. 

J. Exp. Med. 177, 1421-1427, 1993 

A; Title: Structure and functional expression of the human macrophage 
inflammatory 1 alpha (MlP-lalpha) /RANTES receptor. 
A; Reference number: 155671; MUID: 93240122 ; PMID:7683036 
A;Accession: 155671 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 



A; Residues: 1-355 <RES> 

A;Cross-references: GB:L10918; NID:g292416; PIDN : AAA36543 . 1; PID:g292417 
C; Genetics : 

A; Gene: GDB : CMKBRl ; CMKR-1 

A;Cross-references : GDB: 138446; OMIM: 601159 

A;Map position: 3p21-3p21 

C; Super family : vertebrate rhodopsin 

C; Keywords: disulfide bond; G protein-coupled receptor; glycoprotein; 
phosphoprotein; transmembrane protein 

F; 36-60/Domain : transmembrane #status predicted <TM1> 
F; 71-91/Domain : transmembrane jfstatus predicted <TM2> 
F; 10 8- 12 9/ Domain : transmembrane #status predicted <TM3> 
F; 147-171/Domain: transmembrane ftstatus predicted <TM4> 
F; 2 05-2 23 /Domain: transmembrane ((status predicted <TM5> 
F;240-264/Domain: transmembrane ((status predicted <TM6> 
F; 288-305/Domain: transmembrane #status predicted <TM7> 
F;5/Binding site: carbohydrate (Asn) (covalent) ((status predicted 
F;24-273, 106-183/Disulf ide bonds: ((status predicted 

F; 34 5 /Binding site: phosphate (Ser) (covalent) (by casein kinase II) ftstatus 
predicted 

Query Match 19.9%; Score 328; DB 2; Length 355; 

Best Local Similarity 29.0%; Pred. No. 5.9e-20; 

Matches 85; Conservative . 63; Mismatches 127; Indels 18; Gaps 8; 

AFYAI EFI FGLLGN VTVVFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YAN 81 

I : : I : I I : I I : I I : : I : : : I I I I : I I I I I I I I I 
1 PL YS LVFVI GLVGN I LWLVLVQYKRLKNMTS I YLLNLAI S DLL FLFTL P FWI DYKLK 96 

:GT YGDVLC — I SNRYVLHTNLYTS I LFLTFI SMDRYLLMKYPFREHFLQKKEFAI LI S 139 
: I I : I : I I : I I I : I I : : : : I I I I : : : I : : I 



I I I I :| I I : : I : : I I :| I I I ::| I 



Qy 


22 


Db 


37 


Qy 


82 


Db 


97 


Qy 


140 


Db 


155 


Qy 


199 


Db 


215 


Qy 


258 


Db 


268 



'LKRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNLRIASRLD- 257 
III : : : I I I : : : : I I : : I I I : : : I : I 



II:: : I : I : : : I I : I : I : : I : | III 
1CEQS RHLDLAVQ VTEVT AYTHCCVN PVI YAFVGERFRKYL RQLF 316 



RESULT 14 
JC4587 

chemokine (C-C) receptor 4 - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 08-Mar-1996 #sequence_revision 19-Apr-1996 (ftext_change 20-Jun-2000 
C; Accession: JC4 587 

R;Hoogewerf, A.J.; Black, D. ; Proudfoot, A.E.I. ; Wells, T.N.C.; Power, C.A. 
Biochem. Biophys. Res. Commun. 218, 337-343, 1996 

A; Title: Molecular cloning of murine CC CKR-4 and high affinity binding of 
chemokines to murine and human CC CKR-4 . 

A; Reference number: JC4587; MUID : 96136324 ; PMID:8573157 



A; Accession: JC4587 
A; Molecule type: mRNA 
A; Residues: 1-360 <HOO> 

A; Cross-references: EMBL:X90862; NID : gll67851 ; PIDN : CAA62372 . 1 ; PID:gll67852 
A; Experimental source: thymus 

C; Genetics: 1 
A; Gene : cc ckr-4 

C; Superf amily : vertebrate rhodopsin 

C; Keywords: glycoprotein; phosphoprotein; receptor; thymus 

F; 2 , 183, 194/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 72 , 202, 350/Binding site: phosphate (Ser) (covalent) (by casein kinase II) 
#status predicted 

F; 145/Binding site: phosphate (Ser) (covalent) (by protein kinase C) #status 
predicted 

F; 321/Binding site: phosphate (Thr) (covalent) (by protein kinase C) #status 
predicted 

Query Match 19,8%; Score 326.5; DB 2; Length 360; 

Best Local Similarity 28.4%; Pred. No. 8e-20; 

Matches 93; Conservative 57; Mismatches 136; Indels 41; Gaps 10; 

Qy 14 EAILNKYY LSAFYAIEFI FGLLGNVTWFGYLFCMKNWN 52 

1:111 I I : : I : I I I I I I III 

Db 13 ETVYNSYYFYESMPKPCTKEGIKAFGEVFLPPLYSLVFLLGLFGNSVWL-VLFKYKRLK 71 

Qy 53 S-SNWLFNLSISDFAFLCTLPILIKSYANDKGTYGDVLCISNRWLHTNLYTSILFLTF 111 

I : : I I I I I : I I I I : : I I Ml: : I II : : I : I I : 

Db 72 SMTDVTLLNLAISDLLFVLSLPFW-GYY7W)QWVFGLGLCKIVSWMYLVGFYSGIFFIML 130 

Qy 112 ISMDRYLLMKYPFREHFLQKKEFAILISLAVWALVTLEVLPMLTFINSVPKEEGSNCIDY 171 

: I : I I I I : : : : : : I I i : : I I I I : : I 

Db 131 MSIDRYLAIVHAVFSLKARTLTYGVITSLITWSVAVFASLPGLLFSTCYTEHNHTYCKTQ 190 

Qy 172 ASSGNPEHNLIYSLCLTLLGFLIPLSVMCFFYYKMWFLKRRSQQQATALPLDKPQRLW 231 

I : : : I I : : I I I I I I : I hi h: I : I : : I : : 

Db 191 YSVNSTTWKVLS SLEINVLGLLI PLGIM-LFWYSMI I RTLQHCKNEKKNRAVRMIF 245 

Qy 232 LAWIFSILFTPYHIMRNLRIASRLDSWPQGCT-QKAIKSIYTLTRPLAFLNSAINPIFY 290 

I I : I : I I I : : : I I : III::: I I I : : : I I : I 

Db 24 6 GVWLFLGFWTPYNWLFLETLVELEVL-QDCTLERYLDYAIQATETLGFIHCCLNPVIY 304 

Qy 291 FLMGDHYREMLI S KFRQYFKS LT S FRT 317 

I : I : I I I : I I III 

Db 305 FFLGE KFRKYITQL— FRT 321 

RESULT 15 
JN0605 

somatostatin receptor 4 - human 
C; Species: Homo sapiens (man) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 20-Jun-2000 

C; Accession: JN0605; JN0762; A47457 

R;Xu, Y. ; Song, J.; Bruno, J.F.; Berelowitz, M. 

Biochem. Biophys . Res. Commun. 193, 648-652, 1993 

A; Title: Molecular cloning and sequencing of a human somatostatin receptor, 
hSSTR4. 

A; Reference number: JN0605; MUID : 93290656; PMID: 8512564 



A; Accession: JN0605 
A; Molecule type: DNA 
A; Residues: 1-38 8 <XUY> 

A; Cross-references : GB:L14856; NID:g292499; PIDN : AAA36623 . 1 ; PID:g292500 
R;Yamada, Y. ; Kagimoto, S.; Kubota, A.; Yasuda, K. ; Masuda, K. ; Someya, Y. ; 
Ihara, Y. ; Li, Q. ; Imura, H.; Seino, S.; Seino, Y. 
Biochem. Biophys . Res. Commun. 195, 844-852, 1993 

A; Title: Cloning, functional expression and pharmacological characterization o 
a fourth (hSSTR4) and a fifth (hSSTR5) human somatostatin receptor subtype. 
A; Reference number: JN0762; MUID : 933 84 611 ; PMID: 8373420 
A; Accession: JN0762 
A; Molecule type: DNA 
A; Residues: 1-388 <YAM> 

A; Cross-references : GB:D16826; NID:g693907; PIDN ; BAA04 106 . 1 ; PID:g693908 
R;Rohrer, L. ; Raulf, F . ; Bruns, C; Buettner, R. ; Hofstaedter, F. ; Schule, R. 
Proc. Natl. Acad. Sci. U.S.A. 90, 4196-4200, 1993 

A; Title: Cloning and characterization of a fourth human somatostatin receptor. 
A; Reference number: A47457; MUID : 93248256 ; PMID: 8483934 
A;Accession: A47457 
A; Molecule type: DNA 

A;Residues: 1-82, ' T 1 , 84-364 , 1 K 1 , 366-388 <ROH> 

A;Cross-references : GB:L07833; NID:g307429; PIDN : AAA60565 . 1 ; PID:g307430 

A;Note: sequence extracted from NCBI backbone (NCBIN : 130856, NCBIP : 1308 58 ) 

C; Comment: This protein mediates the diverse actions of the tetradecaptide 

somatostatin . 

C; Genetics : 

A; Gene: GDB : SSTR4 

A; Cross-references : GDB:202662; OMIM: 182454 
A; Map position: 2 Opll . 2-2 Opll . 2 
A;Introns: #status absent 
C; Superf amily : vertebrate rhodopsin 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 

lipoprotein; phosphoprotein; thiolester bond; transmembrane protein 

F; 47-73/Domain: transmembrane jjstatus predicted <TM1> 

F; 84-109/Domain: transmembrane #status predicted <TM2> 

F; 12 1-14 2 /Domain : transmembrane #status predicted <TM3> 

F; 162-184/Domain : transmembrane #status predicted <TM4> 

F; 2 08-238/Domain : transmembrane #status predicted <TM5> 

F; 257-284/Domain : transmembrane #status predicted <TM6> 

F; 291-314/Domain : transmembrane ((status predicted <TM7> 

F;24/Binding site: carbohydrate (Asn) (covalent) #status predicted 

F; 119-198/Disulf ide bonds: ftstatus predicted 

F; 161, 253/Binding site: phosphate (Ser) (covalent) (by cAMP-dependent kinase) 
#status predicted 

F; 327/Binding site: palmitate (Cys) (covalent) #status predicted 

Query Match 19.7%; Score 325.5; DB 2; Length 388; 

Best Local Similarity 28.4%; Pred. No. le-19; 

Matches 82; Conservative 66; Mismatches 110; Indels 31; Gaps 5 

Qy 22 LSAFYAI EFI FGLLGNVTWFGYLFCMKNWNS SNVYLFNLS I SDFAFLCTLPI LI KS YAN 81 

: II: : I I : I I I : I I I : : I : I I I I : : : I I : : : I : II 
Db 50 I Q C I YALVC LVGLVGNALVT FVT L RYAKMKT AT N I Y L LN LAVAD E L FML S VP FVAS S AAL 109 

Qy 82 DKGTYGDVLCISNRYVLHTNLYTSILFLTFISMDRYLLMKYPFREHFLQKKEFAILISLA 141 

: I I I I : I I : : I I : I I : I : I II : : : I I : : I I I : I 

Db 110 RHWPFGSVLCRAVLSVDGLNMFTSVFCLTVLSVDRYVAWHPLRAATYRRPSVAKLINLG 169 



Qy 142 VW7VLVTLEVLPMLTFINSVPKEEGSNCIDYASSGNPEHNLIYSLCLTLLGFLIPLSVMCF 201 

II I II: I : : I I : I : : : : I I I I I : I : : 

Db 170 VWLASLLVTLPIAIFADTRPARGGQAVACNLQWPHPAWSAVFVVYTFLLGFLLPVLAIGL 229 

Qy 202 FYYKMV VFL KRRSQQQATALPLDKPQRLWLAWIFSILFTPYHIMRNL 250 

I : I II : | | | : :: | III:: I I : I : : I : : : : : I 

Db 230 CYLLIVGKMRAVALRAGWQQRRRSEKKIT RLVL^4VVWFVLCWMPFYWQLL 281 

Qy 251 R-IASRLDSWPQGCTQKAIKSIYTLTRPLAFLNSAINPIFYFLMGDHYR 298 

: : I I : : : : : I : : I I I I I I : I : : I 

Db 2 82 NLWTSLDA TVNHVSLILSYANSCANPILYGFLSDNFR 319 



Search completed: December 12, 2003, 18:09:05 
Job time : 21 sees 



